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ABSTRACT 


This  research  deals  with  the  specification,  analysis  and 
evaluation  of  some  routing  and  topology  design  procedures  for  large 
store  and  forward  packet  switched  conputer  communication  networks.  The 
procedures  studied  are  an  extension  of  present  techniques  and  rely  on  a 
hierarchical  clustering  of  the  network  nodes. 

Hierarchical  adaptive  routing  schemes  are  investigated  in  the 
context  of  large  networks.  In  particular,  optimal  clustering  structures 
are  determined  so  as  to  minimize  the  length  of  the  routing  tables  used 
by  the  routing  function.  TTie  effect  of  this  reduction  of  routing  infor¬ 
mation  on  the  message  path  length  is  evaluated.  Queueing  models  are 
developed  in  order  to  evaluate  the  del ay- throughput  performance  of  the 
hierarchical  routing  policies  and  to  compare  them  with  present  schemes. 
The  models  prove  the  infeasibility  of  present  routing  procedures  for 
large  networks  and  demonstrate  the  remarkable  efficiency  of  hierarchical 
routing  schemes. 

These  queueing  models  represent  an  extension  of  Kleinrock's 
model  for  networks  in  that  they  consider  nodal  storage  requirements  and 
line  overhead  due  to  routing  updates.  Furthermore,  several  buffer 
sharing  schemes  are  proposed  and  analyzed  in  order  to  optimally  utilize 
nodal  storage. 

A  hierarchical  design  methodology  is  also  presented  as  a 
solution  for  the  reduction  of  the  conputational  cost  involved  in  the 
topological  design  of  large  networks.  A  computational  cost  model  is 
developed  in  order  to  determine  that  clustering  structure  which  achieves 
a  minimal  cost.  Such  optimal  structures  are  found  which  lead  to  very 
significant  savings  in  confutation. 
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GHAPTER  1 


INTRODUCTION 

One  of  the  main  reasons  for  the  great  interest  in  computer  net¬ 
works  is  the  considerable  economy  that  can  be  achieved  through  resource 
sharing  [RODE  70],  Among  such  resources  we  include  computer  power  for 
load  sharing,  specialized  hardware,  specialized  software,  data  banks, 
etc. 

Such  networks  are  called  distributed  computer  communication  net¬ 
works  and  they  made  their  first  appearance  with  the  ARPANET  [HEAR  70] , 
[KLEI  70] ,  [FRAN  70] ,  [CARR  70]  ,  [ROBE  70] . 

Computer  networks  are  also  emerging  as  very  efficient  means  for 
data  communication  between  remote  locations.  The  first  comnercial  data 
carrier,  TELENET,  is  already  operational. 

At  the  root  of  this  booming  demand  for  conputer  networks  is  the 
ever  increasing  need  for  conputer  and  data  communication  powers.  It  is 
projected  that  by  1980  approximately  a  quarter  million  terminals  will  be 
in  use  in  Europe  [EURO  73],  [PETE  73]  and  as  many  as  four  or  five  million 
in  the  United  States  [KLEI  76] . 

A  very  important  component  of  the  network  is  the  communication 
subnetwork.  This  includes  the  hardware  and  software  specifically  dedi¬ 
cated  to  the  transfer  of  data  from  node  to  node.  Many  alternative 
communication  schemes  can  be  implemented  at  the  subnet  level.  Among 
these  are:  circuit  switching  [PORT  71],  packet  switching  (a  form  of 
store -and -forward  coinnunication)  [KLEI  64],  radio  broadcasting  [ABRA  70], 
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satellite  communication  [LAM  74],  or  any  combination  of  the  above,  etc. 

The  selection  of  the  best  scheme  is  a  difficult  problem  and 
depends  very  much  on  the  nature  of  the  traffic  to  be  handled  by  the  net¬ 
work  [CI/)SS  72A,  72B],  [MIYA  75].  The  bursty  nature  of  computer  traffic, 
as  well  as  the  continuously  decreasing  cost  of  computer  hardware 
[ROBE  74],  very  much  favor  the  packet  switching  as  the  technology  for  us 
to  consider. 

The  basic  concepts  and  the  first  packet  switching  computer  net¬ 
work  were  developed  by  the  United  States  Department  of  Defense  Advanced 
Research  Projects  Agency  (ARPA) .  This  network  (ARPANET),  in  operation 
since  1969,  has  been  an  enormously  successful  demonstration  of  the 
packet  switching  technique.  It  has  resulted  in  the  appearance  of  a 
multitude  of  other  networks  throughout  the  world  (the  NPL  network  in 
England,  CYCLADES  in  France,  etc.). 

Present  computer  networks  may  be  characterized  as  small  to 
moderate  sized  networks  (57  nodes  for  the  ARPANET  as  of  December  1975). 
The  predictions  mentioned  above  indicate  that,  in  fact,  large  networks 
of  the  order  of  hundreds  (or  even  possibly  thousands)  of  nodes  are 
imminent. 

In  the  course  of  developing  the  ARPA  network,  a  design  method¬ 
ology  has  evolved  which  is  quite  suitable  for  the  efficient  design  of 
small  and  moderate  sized  networks  [FRAN  72],  [GERL  73A] .  Unfortunately 
the  cost  of  conducting  the  design  is  prohibitive  if  these  same  tech¬ 
niques  are  extrapolated  to  the  large  network  case.  Indeed,  not  only 
does  the  cost  of  design  grow  exponentially  with  the  network  size,  but 
also  the  cost  of  a  straightforward  adaptive  routing  procedure  becomes 


prohibitive.  Other  design  and  operational  procedures  (techniques)  must 
be  found  which  handle  the  large  network  case  and  such  techniques  form 
the  subject  of  this  dissertation. 

1. 1  Routing  for  Packet-Switching  Networks 

In  a  packet  switching  network,  messages  are  partitioned  into 
segments  called  packets  which  then  are  transmitted  through  the  network 
using  the  store- and- forward  switching.  That  is,  a  packet  traveling  from 
source  S  to  destination  D  is  received  and  "stored"  in  queue  at  any  inter¬ 
mediate  node  K  while  awaiting  transmission,  and  is  then  sent  "forward" 
to  node  P,  the  next  node  on  the  route  from  S  to  D,  when  channel  (K,P) 
permits. 

The  selection  of  the  next  node  P  is  made  by  a  well-defined 
decision  rule  referred  to  as  the  routing  policy.  Several  classification 
schemes  have  been  devised  to  characterize  routing  policies  [KLEI  64], 
[FULT  72],  [GERL  73B] ,  [MCCO  75],  (WCQU  74].  Generally  speaking,  routing 
policies  may  be  divided  into  two  main  classes:  deterministic  and  adap¬ 
tive.  While  deterministic  routing  is  more  attractive  to  use  at  the 
design  phase,  adaptive  policies  are  essential  for  the  operation  of  the 
network.  A  main  objective  of  this  dissertation  is  to  specify  and 
evaluate  adaptive  routing  policies  for  large  networks. 

The  major  goal  of  adaptive  routing  policies  is  to  sense  changes 
in  the  traffic  distribution  and  network  status  and  then  to  route  messages 
such  that  the  con jested  and  damaged  areas  of  the  network  are  avoided. 

As  a  consequence,  they  adjust  to  load  fluctuations  and  node  or  branch 
failures.  The  most  commonly  used  adaptive  policies  base  their  decision 


on  routing  information  (delay  estimate,  excess  capacity  estimate,  hop 
number  estimate)  stored  in  tables  at  each  node  of  the  network.  These 
tables  identify  the  output  line  to  select  for  each  destination  in  the 
network.  They  are  updated  periodically  or  asynchronously,  or  a  com¬ 
bination  of  both,  using  routing  information  collected  internally  (at 
each  node)  and/or  provided  from  neighboring  nodes. 

For  large  computer  networks  (on  the  order  of  many  thousands  of 
nodes  or  more) ,  the  length  of  the  routing  table  (which  directs  the  traf¬ 
fic  through  each  node)  will  grow  lineraly  (one  entry  per  node)  with  the 
number  of  nodes,  and  therefore  the  storage  required  to  contain  this  list 
in  each  node  will  be  extremely  costly.  Also,  as  a  direct  consequence  of 
these  large  table  lengths,  the  cost  of  interchanging  routing  information 
among  the  network  nodes  will  also  grow  and  will  represent  a  significant 
burden  on  the  communication  lines  themselves.  All  these  considerations 
suggest  that  a  reduction  of  routing  table  length  of  some  sort  is  called 
for. 

The  main  idea  in  reducing  the  table  length  is  to  provide  at  any 
node's  (say  i)  routing  table,  one  entry  per  destination  node  for  those 
nodes  which  are  close  to  i  (in  terras  of  a  hop  distance  or  some  nearness 
measure)  and  one  entry  per  set  of  nodes  for  those  nodes  located  further 
away  from  i.  The  size  of  this  set  may  increase  with  the  (average)  dis¬ 
tance  from  i  to  the  set  of  nodes.  This  partitioning  of  nodes  into  sets 
may  be  realized  through  a  hierarchical  clustering  of  the  nodes.  Consider 
a  large  distributed  network  and  assume  that  we  can  realize  the  grouping 
of  nodes  into  clusters,  clusters  into  superclusters,  etc.  Indeed,  let  us 
assume  that  there  is  to  be  an  m-level  hierarchy.  Provided  such  a 
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clustering  structure,  a  routing  table  at  any  node  (say  i)  will  contain 
an  entry  for  each  node  in  the  same  cluster  as  i,  and  an  entry  per  clus¬ 
ter  in  the  same  supercluster  as  i,  etc.,  thus  achieving  a  reduction  of 
routing  information. 

It  is  such  a  scheme  that  we  intend  to  define,  analyze,  and  dis¬ 
cuss.  Fultz  [FULT  72],  McQuillan  [MCQU  74]  and  others  proposed  similar 
schemes  but  did  not  provide  any  quantitative  or  experimental  analysis 
as  we  do  here. 

1.2  The  Design  of  Conyuter  Networks 

In  our  previous  considerations,  we  were  given  a  large  distrib¬ 
uted  computer  communication  network  and  the  problem  was  to  devise  an 
appropriate  adaptive  routing  scheme  which  would  operate  efficiently  with 
a  fairly  small  amount  of  routing  information.  Now,  going  one  step  back, 
we  are  interested  in  designing  the  topology  of  a  large  network  under 
some  cost  and  performance  constraints. 

Several  different  formulations  of  the  design  problem  related  to 
the  communication  subnetwork  can  be  found  in  the  literature  [FRAN  72], 
[GERL  73A].  Generally  they  correspond  to  different  choices  of  perfor¬ 
mance  measures,  design  variables  and  constraints.  Here,  we  select  the 
following  very  general  formulations. 

Given;  Node  locations 

Channel  capacity  options 

Minimize:  Total  communication  cost 


Over:  Topology 

Channel  capacities 
Routing  policies 
Subject  to:  Delay  constraint 

Reliability  constraint 
Traffic  requirement 

Along  those  formulations,  several  solutions  have  been  proposed  and 
applied  to  ARPA-like  network  designs.  However,  for  networks  with  more 
than  a  few  hundred  nodes,  we  recognized  that  present  procedures  become 
prohibitively  expensive  because  of  the  large  amount  of  computer  time  and 
storage  needed  to  perform  the  optimization  step.  Design  procedures, 
based  also  on  a  hierarchical  clustering  of  the  network  nodes,  have  been 
proposed  [FRAN  73],  [GERL  73B]  [COVI  74]  to  substantially  reduce  the 
design  cost.  Again  consider  a  set  of  nodes,  initially  not  connected  and 
assume  that  we  wish  to  cluster  the  nodes  into  groups  which  themselves 
will  be  clustered  into  higher  level  groups,  and  so  on  up  to  a  specified 
number  of  hierarchical  levels.  Once  the  hierarchical  classes  are  de¬ 
fined,  then  the  previously  developed  network  design  techniques  for 
moderate  sized  networks  may  be  used  to  design  each  cluster  level  separ¬ 
ately. 

Several  questions  arise  as  to  the  optimal  clustering  structure, 
the  decomposition  of  the  global  performance  variables  and  requirements 
which  then  lead  to  a  set  of  smaller  design  problems.  Frank  and  others 
[FRAN  72],  [FRAN  73]  showed  from  a  feasibility  study  of  a  1000  node 
network  that  indeed,  hierarchical  structures  are  desirable  for  the  design 
of  large  networks.  They  also  posed  the  same  questions  concerning  the 


clustering  structure,  but  failed  to  answer  them  for  the  general  case  of 
cin  arbitrary  number  of  hierarchical  levels.  Such  questions  will  be 
addressed  in  this  dissertation. 


1 . 3  Summary  of  Results 

Hie  above  considerations  demonstrate  the  need  for  some  new 
design  and  operational  (routing)  procedures  for  the  large  network  case. 
They  also  suggest  that  methods  based  on  some  sort  of  decomposition 
scheme  appear  to  be  desirable  for  both  the  routing  and  the  design  of 
large  networks. 

The  goal  of  this  dissertation  is  twofold.  First,  we  develop 
analytic  models  with  which  we  can  predict  and  optimize  the  performance 
of  a  hierarchical  routing  in  large  distributed  networks.  Second,  we 
define  and  optimally  specify  the  decomposition  step  in  a  hierarchical 
design  of  large  networks. 

Chapter  2  is  primarily  concerned  with  the  introduction  of  the 
hierarchical  routing  schemes  and  their  underlying  hierarchical  clustering 
structure  as  solutions  to  the  reduction  of  the  routing  information  and 
its  associated  overhead.  An  optimal  clustering  structure  is  found  to 
minimize  the  length  of  the  routing  table  and  consequently  it  results  in 
a  minimum  cost  routing  scheme.  Enormous  gains  can  be  achieved  whereby 
the  table  length  may  be  reduced  from  N  (N  =  number  of  nodes)  to  e  In  N. 
However,  a  shortcoming  of  the  reduction  in  routing  information  is  the 
increase  in  the  path  length  of  a  message  in  the  network. 

In  Chapter  3.  we  examine  the  effect  of  hierarchical  routing  on 
network  path  length.  Bounds  are  derived  to  evaluate  the  maximum  increase 
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in  path  length  for  a  given  table  reduction.  The  boimds  demonstrate  a 
major  result;  in  the  limit  of  a  very  large  network,  enormous  table  re¬ 
duction  may  be  achieved  with  no  significant  increase  in  network  path 
length. 

The  reduction  in  table  length  means  that  more  channel  capacity 
and  storage  are  available  for  the  transmission  of  data  traffic  in  the 
network.  However,  this  gain  in  communication  power  may  have  to  be 
partially  or  completely  paid  back  to  handle  the  excess  traffic  caused  by 
longer  network  paths . 

In  Chapter  4,  we  address  the  above  issue.  In  particular,  we 
are  interested  in  the  trading  relations  among  the  table  reduction,  the 
nodal  storage,  the  channel  capacity,  the  network  size,  the  throughput 
and  the  delay.  Several  queueing  models  are  developed  to  capture  and 
exhibit  the  interrelationships  among  these  variables.  The  models  demon¬ 
strate  that  for  some  reasonable  cost  and  performance  constraints,  and 
for  a  class  of  symmetrical  and  distributed  networks,  routing  procedures 
under  their  present  form  (non-hierarchical)  become  infeasible  for  a  net¬ 
work  size  beyond  some  "critical"  value;  on  the  other  hand,  hierarchical 
routing  with  an  optimally  selected  table  length,  preserves  a  remarkably 
good  network  performance  for  a  phenomenal  range  of  the  network  sizes. 

The  particular  niimerical  examples  show  that  the  transition  point  where 
hierarchical  routing  becomes  certainly  better  than  a  non-hierarchical 
one  occurs  for  a  network  size  between  100  and  200  nodes. 

The  queueing  models  developed  in  this  chapter  represent,  in  a 
major  part,  an  extension  of  Kleinrock's  model  for  networks  [KLEI  64]. 
They  provide  us  with  some  new  results  concerning  the  effect  of  updates 
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and  storage  on  network  performance. 

In  Chapter  5  we  address  the  following  two  issues:  (i)  the 
assignment  of  nodes  to  clusters,  clusters  to  superclusters,  etc.,  given 
an  arbitrary  network  and  clustering  structure;  (ii)  the  evaluation  of 
hierarchical  routing  as  applied  to  a  more  general  network  environment 
such  as  the  ARPANET.  In  this  chapter  we  mainly  lay  the  framework  and 
introduce  a  methodology  in  defining  nearness  measures  between  pairs  of 
nodes.  A  clustering  technique  (Complete  Linkage  Method)  has  been  modi¬ 
fied  in  order  to  fit  the  constraints  of  our  specific  network  environment. 
The  nearness  measures  and  the  clustering  technique  are  utilized  to 
achieve  the  clustering  of  the  June  1973  ARPANET. 

Furthermore,  the  simulation  of  a  64-node  torus  net  confirms  our 
theoretic  results  and  shows  that  even  for  such  a  moderate  sized  network, 
an  appropriately  selected  hierarchical  routing  begins  to  exhibit  iirprove- 
ments  in  network  performance  as  compared  to  a  non-hierarchical  scheme. 

In  Chapter  6  we  address  some  issues  related  to  the  hierarchical 
design  of  large  networks.  The  emphasis  is  on  the  determination  of  a 
clustering  structure  to  be  used  in  the  design  phase  and  which  minimizes 
the  computational  cost  of  the  design.  Such  a  cost  is  assumed  to  grow 
exponentially  with  the  number  of  nodes  in  the  subnet  to  be  designed.  We 
present  optimum  results  both  for  the  number  of  clusters,  superclusters, 
etc.,  and  the  number  of  hierarchical  levels  when  the  same  design  strategy 
(technology)  is  considered  at  all  levels.  Optimal  clustering  structures 
are  also  determined  when  different  design  strategies  are  considered, 
provided  that  the  number  of  levels  is  given.  An  expression  of  the 
average  delay  of  a  message  in  such  a  hierarchical  network  is  also 
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provided  in  terms  of  the  average  delays  in  the  layer  subnets  composing 
the  network.  This  decomposition  should  consequently  lead  to  the  design 
of  smaller  subnetworks  for  which  we  can  utilize  present  design  strategies. 

In  Chapter  7  we  give  some  concluding  remarks  and  suggest  topics 
for  further  research. 

Some  intermediary,  but  general,  results  were  required  which 
proved  to  be  quite  useful  in  the  analytic  performance  evaluation  of 
hierarchical  routing  in  large  networks  (Chaper  4).  These  results  form 
the  object  of  ^pendixes  A  and  B. 

In  Appendix  A  we  derive  a  closed  form  expression  for  the  average 
shortest  path  length  in  grid-type  networks.  Also  the  distribution  and 
the  corresponding  z- transform  of  path  lengths  in  a  torus  network  are 
defined  and  determined. 

In  Appendix  B  we  consider  various  schemes  for  sharing  a  pool  of 
buffers  among  a  set  of  communication  channels  in  a  single  network  node 
environment  [KAMO  76] .  Severa*  sharing  schemes  are  examined  and  the 
results  of  the  analysis  are  presented  and  displayed  in  a  fashion  which 
permits  one  to  establish  the  tradeoffs  among  blocking  probability, 
utilization,  throughput  and  delay.  The  study  shows  that  no  one  scheme 
is  always  optimal,  and  that  the  selection  of  a  particular  scheme  should 
fit  the  particular  application  considered.  In  general,  we  find  under 
normal  operating  conditions  that  sharing,  with  some  restrictions  on  the 
contention  of  space,  is  certainly  desirable,  especially  when  little 
storage  is  available. 

In  Appendix  C  we  present  some  proofs  and  complementary  results 
for  Chapter  6. 
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Even  though  this  research  addressed  some  design  issues  which 
arise  in  large  computer  communication  networks,  the  methodology  devel¬ 
oped  may  be  applicable  to  more  general  and  large  systems.  In  particular 
we  mention  a  possible  application  to  packet-radio  networks,  where  clus¬ 
tering  of  terminals  in  groups  may  alleviate  some  of  the  difficulties 
encountered  in  performing  the  routing  function. 

In  summary,  we  list  below  the  major  contributions  of  this 

research. 

(1)  We  characterize  and  evaluate  the  performance  of  hierar¬ 
chical  routing  for  large  packet  switched  coin>uter  networks;  models  are 
developed  to  determine  clustering  structures  which  lead  to  a  minimal 
routing  cost  (storage,  capacity)  and  their  effect  on  the  network  path 
length.  More  importantly,  we  can  determine  the  effect  of  table  reduc¬ 
tion  in  terms  of  network  throughput  and  delay.  As  a  result,  we  are  able 
to  demonstrate  that  hierarchical  routing  is  remarkably  efficient  (in 
fact,  necessary)  for  large  distributed  networks. 

(2)  A  general  methodology  is  specified  for  the  design  of 
large  hierarchical  networks.  A  model  is  developed  to  determine  a  parti¬ 
tioning  structure  so  as  to  minimize  the  computational  cost  involved  in 
the  design  of  such  networks. 

(3)  Models  are  developed  to  study  the  effect  of  storage  in  a 
single  node  and  in  a  network  environment.  Several  storage  sharing 
schemes  are  specified  and  analyzed  in  order  to  make  best  use  of  such  a 
resource  (storage) . 


CHAPTER  2 


HIERARCHICAL  ROUTING  PROCEDURES 
FOR  LARGE  COMPUTER  NETWORKS 


The  most  commonly  used  distributed  adi^tive  routing  policies 
base  their  decisions  on  routing  information  (delay,  hop  numbers) 
stored  in  tables  at  each  node  of  the  network.  These  tables  identify 
the  output  line  to  select  for  each  destination  in  the  network.  They 
are  also  regularly  updated  using  routing  information  collected  inter¬ 
nally  (at  each  node)  and/or  provided  from  nei^boring  nodes. 

For  large  computer  networks,  on  the  order  of  many  thousands  of 
nodes,  the  routing  information  may  become  excessively  costly  in  terms 
of  nodal  storage  required  to  keep  the  tables,  CPU  time  needed  to  per¬ 
form  the  tq)dates,  and  channel  utilization  incurred  by  the  exchange  of 
routing  information.  These  considerations  make  current  adaptive 
routing  policies  unattractive  for  large  networks. 

In  this  chapter,  first,  we  will  review  the  iiiq)ortant  aspects  of 
adaq>tive  routing  procedures  for  con5)uter  networks,  then  we  will  discuss 
general  problems  associated  with  the  impact  of  the  network's  size  on 
those  procedures.  Next,  routing  schemes,  based  on  hierarchical  parti¬ 
tioning  of  the  network,  are  presented  as  a  solution  for  reducing  the 
amount  of  routing  information  and  the  associated  overhead.  The  focus 
of  this  chapter  is  to  determine  the  clustering  (partitioning)  structure 
which  results  in  a  minimum  amount  of  routing  information  and,  conse¬ 
quently,  in  a  minimum  cost  routing  scheme.  Unfortunately,  the  gains 
(Stained  from  such  schemes  are  accon^anied  with  an  increase  in  the  path 


length  of  a  message  in  the  network  which  results  in  a  degradation  of 
performance.  This  issue  will  be  the  object  of  the  next  chapter. 

2.1  Important  Aspects  of  Adaptive  Routing  Schemes 

2.1.1  The  Routing  Problem  in  Networks 

In  a  packet  switching  networlc,  messages  are  partitioned  into 
smaller  segments  called  packets  which  then  are  transmitted  through 
the  network  using  the  store-and-forward  technique.  That  is,  a  packet 
traveling  from  source  S  to  destination  D  is  received  and  "stored"  in 
queue  at  any  intermediate  node  K,  while  awaiting  transmission,  and  is 
then  sent  "forward"  to  node  P,  the  next  node  on  the  route  from  S  to  D, 
when  channel  (K,P)  permits  [KLEI  64],  [FULT  72],  [KAHN  72],  [MCQU  74] 
[KLEI  76]. 

The  selection  of  the  next  node,  P,  is  made  by  a  well-defined 
decision  rule  referred  to  as  the  routing  policy.  Routing  policies  may 
be  divided  into  two  main  classes:  deterministic  and  adaptive  [GERL  73C], 
[FULT  72].  A  deterministic  policy  is  time  invariant.  It  may  be 
selected  to  provide  the  optimal  routing  for  a  network  in  steady  state, 
which  has  a  given  traffic  requirement  governed  by  a  certain  probabil¬ 
istic  distribution  (see  Chapter  6  for  more  details),  and  under  the 
assunptions  that  no  failures  can  occur.  The  optimal  policy  may  be 
determined  analytically  [FRAT  73]  and,  therefore,  it  is  attractive  to 
use  it  in  the  design  phase  [GERL  73A].  An  adaptive  policy  is  time 
varying  and  bases  its  decision  on  some  measure  of  the  observed  traffic. 

It  can  adjust  to  load  fluctuations  and  node  or  branch  failures.  For 
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such  a  reason  it  is  more  attractive  for  real  network  operation  than 
a  deterministic  policy.  However,  the  performance  evaluation  of  an 
adaptive  policy  requires  time-consuming  simulations,  which  makes  it 
inadequate  for  use  in  the  network  design  phase. 

2.1.2  Adaptive  Routing  Policies 

The  major  goal  of  an  adaptive  routing  procedure  is  to  sense 
changes  in  the  traffic  distribution  and  network  status  and  then  to 
route  messages  such  that  the  congested  areas  of  the  network  are  avoided. 
It  is  very  important  for  those  procedures  to  ad^t  to  line  and  node 
failures  in  order  to  maintain  a  good  grade  of  service  for  the  network. 
Such  policies  base  their  decisions  on  the  measured  values,  at  given 
times,  of  a  set  of  time  varying  variables  (number  of  messages  enqueued, 
number  of  hops,  etc.)  which  describe  the  salient  features  of  the  state 
of  the  network  (traffic,  topology,  etc.).  Such  information  is  referred 
to  as  routing  information.  A  central  node  could  provide  the  routing 
information  (centralized  control)  and  distribute  it  to  all  the  nodes 
in  the  network,  or  the  nodes  could  collaborate  in  computing  the 
routing  information  directly  (distributed  control)  [KLEI  64],  [FULT  72], 
[KAHN  72]. 

Specifically,  routing  information  is  stored  in  tables  at  each 
node  and  is  used  to  identify  the  output  line  for  each  destination. 

More  detailed  classifications  of  the  routing  policies  can  be  found  in 
[FULT  72],  [GERL  73C]  and  [M(X)U  74].  In  this  study,  we  limit  our 
considerations  to  the  distributed  routing  policies  which  base  their 
decisions  on  routing  information  contained  in  routing  tables 


individually  maintained  at  each  node.  The  tables  are  i^dated  peri¬ 
odically  or  asynchronously  or  a  combination  of  both  [FULT  72]  using 
routing  information  collected  internally  and  provided  from  neighboring 
nodes.  Such  a  scheme  is  used  to  operate  the  ARPANET  [MCQU  74]. 

Typically,  in  a  network  with  N  nodes,  each  node  ("IMP"  in  the 
ARPANET  terminology)  i,  (i  =  1,  2,  ...,  N) ,  has  a  routing  ^able  (to  be 
denoted  by  RT)  which  is  composed  of  N  entries.  Each  entry,  say  k,  is 
subdivided  into  three  (or  more)  fields.  TTie  "delay"  field  indicates 
the  estimated  minimal  delay  from  node  i  to  node  k.  The  "next-node" 
field  indicates  the  next  node  a  message  must  be  forwarded  to  on  its 
way  to  node  k,  using  the  "minimal"  delay  path.  The  "hop"  field  rep¬ 
resents  the  minimum  number  of  line  hops  to  node  k.  The  purpose  of  the 
hop-field  is  to  allow  the  detection  of  line  or  node  failures  in  the 
network . 

Each  node  periodically  (i.e. ,  every  .64  sec  in  the  ARPANET, 
for  a  heavily  loaded  50  kilobit  line)  sends  to  and  receives  update 
messages  from  neighboring  nodes;  these  updates  are  not  synchronized 
among  nodes.  Upon  reception  of  an  update,  a  node  updates  its  own 
routing  table,  using  the  delays  measured  on  its  output  lines  and  the 
delay  information  found  in  the  update  message.  An  exanple  of  an  up¬ 
dating  rule  is  provided  in  Chapter  5  and  is  used  in  a  program  developed 
to  simulate  computer  networks. 

To  summarize,  let  us  say  that,  fundamental  to  the  operation  of 
the  distributed  adaptive  routing  schemes  is  the  storage,  the  propa¬ 
gation  and  the  iq)dating  of  routing  tables.  Also,  it  is  important  to 
note  that  in  such  schemes,  the  routing  tables  must  contain  a  number  of 


entries  equal  to  the  number  of  nodes  in  the  network. 

For  large  distributed  networks,  on  the  order  of  hundreds  or 
thousands  of  nodes,  the  adaptive  routing  policies  in  their  present  form 
become  unsuitable.  This  comes  abbut  for  the  following  reasons: 

i.  The  excessive  amount  of  storage  required  at  each  node 

to  store  the  routing  tables  will  substantially  reduce  the  storage  avail¬ 
able  for  other  functions  such  as  store- and- forward,  flow  control  and 
reassembly  functions.  Either  that  or  extra  nodal  cost  is  incurred  if 
more  storage  must  be  provided. 

ii.  The  large  CPU  overhead  required  for  the  maintenance  of 
the  routing  tables  (frequent  computations  of  the  best  routes)  will 
induce  either  large  delays  in  the  forwarding  of  messages  or  higher 
nodal  cost  for  faster  CPU's. 

iii.  If  no  added  capacity  is  provided,  the  channel  capacity 
required  for  the  transmission  of  the  routing  messages  could  become  so 
demanding  as  to  significantly  reduce  the  throughput  (load  carried)  of 
the  network. 

iv.  The  large  delays  incurred  in  the  propagation  of  the 
routing  information  throughout  the  network  will  reduce  the  degree  of 
adaptability  of  the  routing  scheme  to  changes  in  the  network.  In  order 
to  maintain  a  good  response  time  in  avoiding  congested  areas  it  is 
necessary  to  increase  the  rate  of  updates  which,  consequently,  will 
require  more  CPU  and  channel  capacity. 

In  view  of  the  above  considerations,  routing  schemes  bsised  on 
a  hierarchical  clustering  of  the  network  are  proposed  as  solutions  to 
keep  the  amount  of  routing  information,  hence  the  storage,  the  CPU  and 
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the  line  overhead  associated  with  it,  within  reasonable  limits  without 
deteriorating  the  performance  of  the  network.  With  respect  to  (iv)  , 
because  of  the  smaller  routing  tables,  it  is  possible  to  increase  the 
update  rate  without  much  extra  cost.  Fultz  [FULT  72]  and  McQuillan 
[MCQU  74]  proposed  similar  schemes  but  did  not  provide  any  quantitative 
analysis. 

2.2  m- level  H^ierarchical  Routing  (MHR)  Schemes 

Because  of  the  underlying  hierarchical  partitioning  of  the  net¬ 
work,  these  schemes  are  referred  to  as  the  MIR  schemes. 

The  main  idea  in  reducing  the  table  length  (routing  information) 
is  to  keep  at  any  node,  say  i,  complete  routing  information  about  nodes 
which  are  close  to  i  (in  terms  of  a  hop  distance  or  some  nearness 
measure) ,  and  lesser  information  about  nodes  located  further  away  from 
i.  This  can  be  realized  by  providing  one  entry  per  destination  for 
the  closer  nodes,  and  one  entry  per  set  of  destinations  for  the  remote 
nodes.  The  size  of  this  set  may  increase  with  the  (average)  distance 
from  i  to  the  nodes  in  the  set.  A  similar  concept  underlies  the 
mechanisms  of  large  information  systems  with  pyramidal  structures, 
in  which  information  is  more  and  more  aggregated  as  we  move  up  to  the 
higher  levels  in  the  hierarchical  organization.  Aggregation  of  infor¬ 
mation  or  variables  is  commonly  introduced  when  dealing  with  large 
systems  [MESA  70],  [CHUR  68],  [SCHO  71]. 

For  the  routing  in  large  nets,  the  aggregation  of  the  routing 
information  is  achieved  through  a  hierarchical  partitioning  of  the 
network.  Basically,  an  m-level  liierarchical  flustering  (MHC)  of  a  set 


of  nodes  consists  of  grouping  the  nodes  (which  we  shall  define  as  0^^ 

St 

level  clusters)  into  1  level  clusters,  which  in  turn  are  grouped  into 

2''^  level  clusters,  etc.  This  operation  continues  in  a  bottom  up 

fashion  until  the  grouping  of  the  m-2”*^  level  clusters  into  m-1^^  level 

th  th 

clusters  whose  union  constitute  the  m’"  level  cluster.  The  level 
cluster  is  the  highest  level  cluster  and  as  such  it  includes  all  the 
nodes  of  the  network.  Hie  MHC  will  be  described  more  formally  in 
Section  2.3.1. 

Since  MIR  schemes  are  based  on  an  m-level  hierarchical  cluster¬ 
ing  of  the  nodes  of  the  network,  only  one  entry  in  the  routing  table, 
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at  any  node,  say  i,  is  provided  for  each  node  in  the  same  1  level 

st 

clusters  as  i,  and  for  each  1  level  cluster  (set  of  nodes)  in  the 
same  2”^  level  cluster  as  i,  and  in  general  for  each  k-1^^  level 
cluster  in  the  same  k^^  level  cluster  as  i  (k  =  1,  2.,  ...,  m) .  The 
structure  of  this  scheme  can  be  best  understood  by  an  example.  Fig.  2.1 
shows  a  3-level  hierarchical  clustering  imposed  on  a  24  node  network. 

The  clustering  leads  to  the  tree  representation  shown  in  Fig.  2.2,  where 
nodes  are  identified  using  the  Dewey  notation  [KNUT  69] .  To  each  node 
is  now  associated  a  reduced  routing  table.  Fig.  2.3  shows  the  entries 
of  node  1.1. I's  routing  table;  the  number  of  entries  is  now  10  instead 
of  24  without  clustering.  As  an  example,  the  routing  of  a  packet  from 
node  1.1.1  to  node  3.2.2  may  proceed  as  follows;  Node  1.1.1  recognizes 
from  the  address  of  the  destination  node,  3.2.2,  that  it  has  to  use  entry 
3,  of  the  2*'*^  level  cluster  entries,  to  decide  upon  the  next  node  to 
which  the  packet  must  be  forwarded.  When  the  packet  reaches  a  node,  say 
3.1.1,  in  the  2^^  level  cluster  3,  then  that  node  will  in  turn  use  the 
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second  entry  among  the  1  level  cluster  entries.  Finally,  when  the 
packet  enters  the  destination  cluster,  3.2,  the  routing  will  be  done 
using  the  0^^  level  cluster  entry  number  2.  Notice  that  it  was  assumed 
that  the  MIC  results  in  connected  subgraphs. 

From  the  above  considerations,  the  following  problems  must  be 
resolved  at  the  outset: 

i.  The  determination  of  an  appropriate  clustering  structure, 
i.e.,  the  size  of  the  clusters  at  all  levels  and  the  number  of  levels 

ii.  The  definition  of  an  aggregate  routing  variable  for  the 
clusters  at  all  levels;  the  specification  of  new  updating  rules,  if 
necessary 

iii.  The  assignment  of  the  nodes  (0^^  level  clusters)  to  1^^ 

S  t  Ild 

level  clusters,  1  level  clusters  to  2  level  clusters,  etc. 

The  rest  of  this  chapter  will  focus  on  (i).  (ii)  and  (iii)  ' 
are  respectively  treated  in  Chapters  3  and  5. 

Finally,  it  is  also  necessary  to  evaluate  the  performance  of 
the  WIR  schemes  as  compared  with  the  present  non-clustered  schemes. 

This  question  will  be  examined  in  various  ways  in  Ch«q)ters  S,  4,  and  5. 

2.3  Minimum  Routing  Information 

The  hierarchical  partitioning  of  the  network  has  as  an  objective 
the  reduction  of  the  size  of  the  routing  table.  It  is  then  important 
to  determine  the  specific  clustering  sttructure  (cluster  sizes,  number 
of  levels)  that  will  result  in  a  minimum  table  length.  The  optimal 
sizes  found  will  serve  as  the  input  parameters  to  the  clustering  tech¬ 
niques  (see  Chapter  5)  whose  function  is  to  assign  nodes  to  clusters. 


clusters  to  superclusters,  etc.  In  what  follows,  we  will  introduce  the 
tree  representation  for  the  ^f^C  and  formally  pose  the  problem  of  find¬ 
ing  an  optimal  clustering  structure.  We  will  then  proceed  with  the 
derivation  of  the  optimal  solution  and  the  study  of  its  characteristics. 

2.3.1  Tree  Representation  for  the  >fiC  and  Problem  Statement 

Any  hierarchical  classification  scheme  lends  itself  to  a  tree 
representation  [KNUT  69].  The  tree  structure  has  already  been  intro¬ 
duced  in  Fig.  2.2,  to  represent  the  3-level  hierarchical  clustering  of 
the  24-node  network  in  Fig.  2.1.  The  representation  of  an  m-level 
hierarchical  clustering  is  shown  in  Fig.  2.4. 

2.3.1. 1  Notation  and  Definitions 
Definitions ; 

k  level  cluster,  Cj^,  is  defined  recursively  as  a  set  of 
k-1  level  clusters.  It  corresponds  to  a  node  at  level  k,  in  the  tree 
representation  of  Fig.  2.4. 

A  k^^  level  cluster  is  identified,  similar  to  the  Dewey  notation 

[KNUT  69],  by  a  vector  of  predecessors,  ij^^^  =  (i^,  ijjj_j.  •••> 

which  can  subsequently  serve  as  an  address  of  Cj^.  The  index,  i^, 
s  t 

indicates  the  m-1  level  cluster,  say  which  Cj^  belongs; 

i_.  ,  indicates  the  m-2”*^  level  cluster  in  C  ,  (i  )  to  which  C,  belongs; 
m-1  m-1^  m  k  * 

etc.  The  notation,  i^,  o*”  Sc^~k+1^’  will  be  used 

when  there  is  a  need  to  identify  Cj^. 

Notice  that  a  leaf  in  the  tree  representation  corresponds  to  a 
node  (0^^  level  cluster)  in  the  network,  and  to  any  node  is  associated 
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an  address  vector  which  will  be  vised  for  the  routing  of  messages. 

The  degree  of  a  level  cluster,  Cj^,  is  defined  as  the  number 
of  k-1  level  clusters  included  in  Cj^.  It  also  indicates  the  downward 
degree  of  the  corresponding  node  in  the  tree. 


Notation  and  further  definitions 


RT  =  Routing  Table 

WC  =  m-level  Hierarchical  Clustering 

NHR  =  m-level  Hierarchical  Routing 

S  I  Set  of  nodes  in  the  network 

N  =  size  of  the  above  set 

m  =  number  of  levels  in  th®  hierarchy 

Cj^  =  k^  level  cluster,  represented  by  a  node 
at  level  k  in  the  tree  structure 

Cq  =  0^^  level  cluster,  network  node 
~k+l  '  ^^m’  Vl’  •••’  Vl^  “  vector  of  predecessors 
C^Cik+P  ^  S^^m’  Vl^  '  completely  identified  k^^ 

level  cluster 

”k^~k+l^  =  •••»  “  degree  of  the  above  cluster 

”m  “  d®gree  of  the  m^^  level  cluster  -  (root  of  tree) 


jQj^  =  vector  of  degrees  of  all  k^^ 

~k+l  level  clusters 

S  =  (l!i»  S2*  •••»  degree  vector 


2. 3. 1.2  Expressions  for  the  length  of  the  Routing  Table  and  the  Size 
Constraint 

St 

The  sumnation  of  the  degrees  of  all  the  1  level  clusters 
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gives  the  total  number  of  nodes  in  the  network  (i.e.,  the  total  number 
of  leaves  in  the  tree  structure,  Fig.  2.4).  Hence 

"m  "k^V  •••’  VP  "2^V  ^3^ 

Eq.  (2.1)  will  generally  serve  as  a  constraint  over  the  choice  of  the 
optimal  degree  vector  and  it  will  be  referred  to  as  the  size 
constraint. 

Let  ^-ECqC^j)]  be  the  length  of  the  RT  at  node  C^Cij);  it  is 
defined  as  the  number  of  entries  in  that  table.  Then 

MCo(i,)l  .  n,(l, . 1,*,)  (2.2) 

The  assumption  is:  each  node  of  the  network,  0^(1^),  contains  an  RT 
with  an  entry  for  each  k-1*^  level  cluster  in  the  same  k^  level  cluster 
as  CqC^j)  (there  are  nj^(ij|»  such  entries),  and  this  for 

k  *  1,  2,  ...,  m. 

In  order  to  siii;)lify  the  manipulation  of  the  RT's,  we  will 
also  assume  that  equal  length  tables  are  provided  at  all  nodes.  Con¬ 
sequently,  if  £  is  that  length,  it  must  accomodate  the  number  of  entries 
in  the  RT  of  any  node. 

Hence 

£(m,  n)  a  max  \  i »  •••»  ^^.3) 

lover  all!  (k»l  ^ 

\  nodes  / 
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2. 3. 1.3  Problem  Statement 


given  : 


minimize  :  &(m,  n) 

over  :  m  and  n 
subject  to  :  size  constraint 


[see  Eq.  (2.3)] 


(2.4) 


size  constraint  [see  Eq.  (2.1)] 

m  a  positive  integer  variable 
n  a  vector  of  positive  integer  variables 


2.3.2  Real-Valued  Solution  of  the  Optimization  Problem 


We  proceed  to  first  solve  Problem  2.4,  with  the  assumption  that 
is  a  real  valued  vector.  This  is  in  order  to  obtain  an  explicit 
analytical  expression  for  the  optimal  solution.  As  a  consequence  of 

”2 

this  assumption,  a  summation  of  the  type  ^  becomes  meaningless 

i2=l 

if  n2  is  not  at  i^tteger,  unless  all  the  nj(i2)*s  are  equal  (to  n^)  then 
the  summation  becomes,  n2nj.  In  fact,  the  solution  of  the  optimization 
problem  will  show  that  clusters  of  the  same  level  must  be  of  equal 
degree,  hence,  all  the  summations  in  Eq.  (2.1)  will  become  meaningful 
a  posteriori. 


2. 3. 2.1  Optimality  for  a  fixed  m 


Proposition  2.1 

Given  m,  the  number  of  levels  in  the  hierarchy,  and  assuming 
that  n  is  a  real  valued  vector,  the  solution  of  Problem  2.4  is  such 
that: 
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(a)  All  clusters  at  all  levels,  k  =  1,  ...»  m,  are  composed 
of  the  same  nu]id>er  of  lower  level  clusters, 

. W  =  \  ^  •••'  W  ^ 

(2.5) 

s  k  =  1,  2,  ...,  m 

(b)  With  this  optimum  assignment  the  minimum  table  length  is 


A(m)  =  (2,6) 

Proof ; 

The  proof  proceeds  by  induction  on  the  number  of  levels,  m. 
First,  we  start  by  showing  that  Proposition  2.1  is  true  for 
For  m  ■  2,  the  problem  becomes: 


min  :  max  {nj{i2)  +  n2} 


over  1, 

A 


over  :  Bj  =  {nj(i2)}i  *  "2 


(2.7) 


s.t.  :  J]  ^ 


~1*  ’'2 


equivalent  to 

n 

n »  Sj ,  n2 

n  >  nj(i2)  ♦  n2 


Y,  "1(12)  “  N 
i2-l  ^ 


(2.8) 

^2  *  ^  #  • . .  f  ^2 


Bj.  n2  positive 
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Let  be  fixed.  Then,  summing  the  first  constraint  in  Problem  2.8, 
with  respect  to  n^,  we  find 


n. 


n.n  >  X) 


i2=l 


+  n. 


(2.9) 


If  n  is  feasible,  i.e.,  if  it  satisfies  Eq.  (2.1),  then  Eq.  (2.9) 
becomes 


n 


¥  n  feasible. 
0^ 


Ihe  above  equation  provides  a  lower  bound  on  the  optimal  solution  for  a 
fixed  n2.  Consequently,  if  a  feasible  solution  achieves  that  lower 
bound,  then  it  must  be  optimal.  Such  a  solution  is 


"l^^2^  ■  n2 


•  9  9  f  ^2 


(2.10) 


If  we  now  let  n2  be  a  variable,  the  problem  reduces  to 

N  ^ 

min  n  =  — 

"2  ^ 

I 

over  n2,  ”2  2^  0 
1/2 

whose  optimum  is  achieved  for  n2  =  N  '  which,  combined  with  Eq.  (2.10) 
proves  that  Proposition  2.1  is  true  for  m  =  2. 

Assuming  that  Proposition  2.1  is  true  for  up  to  m  -  1  levels, 
let  us  show  that  it  is  true  for  m  levels.  The  size  constraint, 

Eq.  (2.1),  is  equivalent  to  the  following  set  of  constraints: 
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E  •**  r  "i^V  •••' V  ' 


X.  PCiJ  =  N 
1  *1 
m 


i  »  1,  ...,  n 
in  in 


(2.11) 


(2.12) 


Let  tis  fix  the  variables  n  and  p(Of  i  «  1,  ...,  n  ,  sudi  that 

IB  n  m  n 

Eq.  (2.12)  is  satisfied.  Problem  2.4  becomes  decoiq)osable  into  n^ 
subproblems,  corresponding  each  to  a  given  value  of  the  index  i  . 


a  subproblem,  for  a  given  i  ,  is 

IB 


nan :  max  •< 

over  all  nodes  i 

in  same  m-1®^ 
level  cluster 

over:  iij,  n^,  ....  n^_j 


s.t.:  Eq.  (2.11)  must  be  satisfied. 


(  .  .  I 

|"m*  "k^^m*  ^m-1*  •'*’  ^2^ 


From  the  induction  hypothesis,  the  solution  of  the  above  problem  is 
such  that 

1 

"k^^m*  Vl’  ^  ^  ^^m'  Vl*  •••*  ^k+1 

i_  fixed,  and 
m 

V  k  =  1,  2,  ....  m-1 


with  such  an  assignment  the  minimum  objective  is  equal  to 


(m  -  l)[p(i  )]®‘^  +  n_  . 


If  we  now  let  the  p(i  )'s  be  variable  but  we  keep  n_  fixed,  the  problem 

+  (®  -  ^Hp(i 

isi  sn  <  ®  ®  / 

m  m 

Pt^nP 

Eq.  (2.12)  must  be  satisfied  . 

The  above  problem  is  similar  to  Problem  2.7;  hence,  for  a  fixed  n^, 

I 

the  solution 

PfV  *  ^  =  * . 

n 

is  optimal.  Consequently,  we  are  left  with 

1 

N 

min  £  ■  (m  -  1)  —  ♦  n 

n  m 

m 

over  n  ,  n_  >  0 

m  m 

Differentiating  £  with  respect  to  n^,  we  find 

m 

From  the  above  equation,  we  determine  that  £  is  minimum  for 

n^  »  (2.15) 

m 


becomes 


min; 


over: 


s.t. 


and  the  minimum  value  of  £  is 


Substituting  Eq.  (2.15)  into  Eq.  (2.14),  then  Eq.  (2.14)  into  Eq.  (2.13) 
we  arrive  at  Eq.  (2.5).  Also,  Eq.  (2.16)  is  exactly  equal  to  Eq.  (2.6) 

Q.E.D. 

2. 3.2.2  Global  Optimality 

So  far  we  have  solved  for  the  optimal  clustering  when  m,  the 
nunber  of  levels  is  fixed.  We  now  intend  to  let  m  vary  and  conse¬ 
quently  solve  for  the  global  optimum.  In  other  words,  we  intend  to 
solve  Problem  2.4  in  its  entirety  except  that  the  components  of  the 
degree  vector  n  and  the  number  of  levels  m,  are  assumed  to  be  real 
variables. 

Proposition  2.2 

Under  the  conditions  of  Proposition  2.1  and  m  being  a  real 
variable,  the  global  optimum  clustering  is  achieved  for  a  number  of 
levels 

m*  =  In  N  (2.17) 


and  a  degree  vector  n* 

n*  =  e  =  2.718...  k  =  1,  2,  ...,  m^ 


The  corresponding  minimum  table  length  is 

*  e  In  N 


(2.18) 


(2.19) 


Proof; 

Proposition  2.1  gave  us  the  optimal  clustering,  for  a  fixed  m. 
Consequently  we  are  left  with  the  minimization  of  Mm),  Eq.  (2.6),  with 
respect  to  m.  Differentiating  Eq.  (2.6)  with  respect  to  m,  we  find 
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cUt(Tn)  _  /m  -  In  N  \  j^l/m 
dm  ”  \  m  / 

The  above  equation  shows  that  r(m)  is  minimum  for  m  as  given  in  Eq. 
(2.17).  Substituting  Eq.  (2.17)  into  Eqs.  (2.5)  and  (2.6),  we  arrive 
at  Eqs.  (2.18)  and  (2.19). 


2. 3. 2. 3  Duality 

A  simpler  formulation  of  problem  2.6  may  be  obtained  by  directly 
imposing  that  all  level  clusters  are  of  equal  degree,  n^ 

(k  =  1,  ...,  n) .  Hence  Problem  2.6  becomes 


I 


given :  N 

m 

min:  A  =  2^  n. 

k=l 


over:  n=  (n^,  ...,  n^),  m 
m 

s.t. :  n  n,  =  N 

k=l 

n,  m  positive  integer  valued. 


(2.20) 


It  can  be  (directly)  shown  by  induction,  that  the  real  valued  solution 
of  the  above  problem  verifies  Propositions  2.1  and  2.2. 

The  dual  formulation  of  Problem  2.20  is 


■given:  I 


max:  N  =  H  n. 
i=l 


over:  n,  m 
m 

s.t. 


2:  n,  =  £ 
k=l 


(2.21) 


n,  m 


positive  interger  valued. 


The  objective,  in  the  formulation  above,  is  to  find  the  maximum  number 
of  nodes,  N  ,  such  that  there  exists  an  MiC  whose  application  results 
in  a  routing  table  of  length  !>•  The  dual  propositions  to  2.1  and  2.2 
are  respectively. 

Proposition  2.3 

For  a  fixed  m  and  I,  the  real  valued  solution  of  Problem  2.21 
is  such  that 


nj^  =  —  k  s  1,  2,  . . . ,  m 


with  this  assignment 


« -  (if 


Proposition  2.4 


The  real  valued  global  optimum  of  Problem  2.21  is  such  that 


-  e 

"k  “  ® 

N*  =  e^/® 


*  1 ,  . . . , 


Proposition  2.3  can  be  directly  proven  by  induction  on  the 
number  of  levels  in  the  hierarchy.  Proposition  2.4  is  a  direct  con¬ 
sequence  of  Proposition  2.3. 


2. 3. 2. 6  Numerical  examples 

Figs.  2.5  and  2.6,  respectively,  illustrate  the  behavior  of 
T/N  and  T/l^  (see  Eqs.  (2.6)  and  (2.19)  with  respect  to  m  and  for 
several  values  of  N.  The  plots  exhibit  a  flat  area  around  the  minimum. 
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They  also  show  an  initial  fast  decrease  of  I  toward  a  value  close  to 

I 

the  minimum.  This  last  property  is  better  illustrated  in  Fig.  2.7 
where  T/N  is  plotted  with  respect  to  N  for  m  e  {l,  2,  ...,  In  n}  ; 
this  indicates  that  most  of  the  table  reduction  can  be  obtained  with 
hierarchical  clustering  whose  nunber  of  levels  is  quite  a  bit  smaller 
than  m^  (Eq.  (2.17)). 

2.3.3  Integer  Solution 

In  this  paragraph  we  intend  to  solve  the  integer  optimization 
problem  as  formulated  in  2.20,  except  for  the  size  constraint  which  is 
changed  to  an  inequality,  the  problem  becomes 

m 

imin:  S  n. 

k=l 

over:  n,  m  integer  valued  (2.22) 

m 

s .  t . :  n  ^  N 
k*l 

The  above  modification  is  introduced  to  avoid  dealing  with  enqpty 
feasible  sets  of  vectors  n,  for  some  values  of  m  and  N.  A  solution  n, 

“  th 

such  that  n  n,  >  N,  only  implies  that  the  degree  of  any  k  level 

k=l  ^ 

cluster,  k  =  1,  ...,  m,  must  be  less  than  or  eqxial  to  n^^.  Practically 
this  means  that  there  will  be  unused  entries  in  some  of  the  routing 
tables. 

Recall  that  the  global  optimum  real-valued  solution  is  such 
that  all  the  components  Oj^'s  are  equal  to  e,  therefore  in  the  integer 
case  we  are  not  surprised  that  the  following  proposition  holds  true. 
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Proposition  2.S 

There  exists  a  global  optimum  vector  n*  which  is  composed  of 
zero,  one  or  two  components  equal  to  2,  with  all  the  others  equal  to 
3. 


Proof 

The  idea^  is  that  any  number  (component  of  n)  or  set  of  numbers 
can  be  replaced  by  a  set  of  2's  or  3's  which  results  in  the  same  sum 
but  a  higher  product.  Hence  the  new  set  is  at  least  as  good  as  the 
original.  As  an  example,  we  list  typical  transformations. 


Original 

Numbers 

4 

5 

6 
7 

2,  2,  2 


Transformation 

Sum 

Original 

Product 

New 

Product 

2,  2 

4 

4 

4 

2,  3 

5 

5 

6 

3,  3 

6 

6 

9 

2,  2.  3 

7 

7 

12 

3,  3 

6 

8 

9 

Let  us  now  proceed  with  the  proof  by  showing  that  we  can  always 
derive  from  an  optimal  solution  which  does  not  satisfy  Proposition  2.5, 
one  which  does.  Let  n*,  m,,  be  a  solution  which  does  not  satisfy  Prop¬ 
osition  2.5;  then,  at  least  one  of  the  following  conditions  is  true: 

(1)  at  least  one  component  is  equal  to  1,  (2)  at  least  one  component 
is  greater  than  3,  finally  (3)  at  least  three  conqponents  are  equal  to 
2. 

With  respect  to  (1)  let  us  prove  by  contradiction  that  this 

situation  cannot  occur.  Assume  that  n  =1,  then  consider  the 
_ 

^  This  idea  was  suggested  by  Dr.  D.  Cantor,  Mathematics  Department, 
UCU. 
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vector  n'  composed  of  the  other  -  1  con^onents  of  n^,  i.e.. 


n,'  =  n. 


k  «  1 ,  . . . ,  m*  - 


consequently 


m^-1  m* 

m^-1 

n  "k  =  n  "k 


Hence  ij'  is  a  feasible  vector  which  results  in  a  better  objective; 
that  is  a  contradiction. 

With  respect  to  (2) ,  assume  first  that  n  >5,  then  consider 

">*  ” 

the  vector  jj*  composed  of  +  1  elements  which  are  such  that 


k  ~  1^  ro^  “  1 


n*  ,  =  n  -  3 
m*+l  m^ 


With  the  above  transformation  the  objective  function  is  obviously  un¬ 
changed.  The  product  is  such  that 


n'  n*  .1  =  n„  +  1  ♦  2n  -  10 
m^  m^  m^ 

hence  n_  2  5  -e>  n'  n'  ,  >  n  +1 

®*  «*  m*+l  —  m* 

m^+1  /m*-l  \  m* 

n  "k  ■  f  n  ”v)  "m  *1  >  n  n. 

k+l  \k-l  7  kil  ^ 

which  means  that  n'  is  also  feasible,  hence  optimal. 

"m.  *  *  choose  n»  -  n’  =  2  and  still  obtain 

a  feasible  vector  jj'.  The  above  operations  can  be  repeated  several 
times  on  the  constructed  vecton  n'  in  order  to  transform  all  the 
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coirponents  which  are  greater  than  3. 

With  respect  to  (3),  assume  that  n_  ^  =  n_  ,  =  n_  =2; 
- 1 n*-2  m*-l  m* 

then  consider  the  vector  nj  composed  of  -  1  elements  which  are  such 


that. 


With  the  above  transformation  the  objective  fuction  is  unchanged.  The 
product  of  the  components  is  increased  by  1.  As  a  result  n*  is  also 
feasible,  hence  optimal. 

The  above  operation  can  be  repeated  several  times  on  the  con¬ 
structed  vectors  n',  until  we  reach  a  solution  with  no  more  than  two 
2's. 

Finally,  we  can  see  that  the  repeated  ^plication  of  the 
transformations  shown  for  (2)  and  (3)  will  eventually  generate  a  vector 
n'  which  satisfies  Proposition  2.5. 

As  a  consequence  of  Proposition  2.5  the  search  for  the  optimal 
number  of  levels  is  reduced  to  three  possibilities.  From  Problem  2.22, 
the  optimal  m,  must  be  such  that 

3""*2^  >  N 

where  x  e  {O,  1,  2} 


Hence,  the  three  possible  values  of  m  are: 


Finally  the  optinal  ■,  ,  is  the  solution  of 


■in  :  £  *  5n  -  X 

(over  (m.x)  e  {(n^,  0),  (uij,  1),  (m2, 2)} 

Notice  that  the  optimal  pair  (m,x)  gives  the  conposition  of  the  optimal 
vector  n*. 

Proposition  2.6 

Given  m,  there  exists  an  optimal  vector  n  which  is  such  that 
no  two  conponents  differ  by  more  than  1,  and  which  is  given  by, 

k  =  2,  3,  . . . ,  m  (2.23) 

'  It  **4' 

i»k+l  ^ 

or  any  other  permutation  of  the  above  solution. 

Proof 

Let  us  show  that,  given  any  two  numbers  which  differ  by  more 

than  1,  we  can  replace  them  by  exactly  two  nuobers  which  do  not  differ 

by  more  than  1  and  which  result  in  the  same  sum  but  in  a  better  product. 

IVo  cases  to  consider  depend  on  whether  the  difference  between  the  two 

numbers  is  odd  or  even: 

1/  (even) ,  pick  n  and  n  2p. 

Their  sum  is  2n  2p. 

2 

Iheir  product  is  n  2pn. 

Let  us  replace  them  by  n  ♦  p  and  n  ^  p;  then  the  sum  is  still  the 
same,  but  the  product  is 


"k  “ 


tf-T 
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n  +  2pn  +  p 
2 

which  represents  an  inprovement  of  p  .  A  similar  proof  can  apply  for 
the  odd  case. 

From  the  above  property,  we  conclude  that  any  nj^  k  =  1, 
is  equal  to  a  given  number,  either  a  or  a  ■*-  1.  If  we  let  x  represent 
the  number  of  components  equal  to  a  1,  then  the  problem  reduces  to 

!min  i  =  (m  -  x)a  +  x(a  +  1)  =  ma  +  x 
over  (a,  x) 

s.t.  a  (a  +  1)  ^  N 

a  positive  integer;  x_5.  m,  positive  integer. 

Let  us  show  that  there  exists  at  least  one  conponent,  say  n^,  equal  to 
From  the  constraint  above,  the  optimal  a  is  such  that 

(i)  X  =  0  “>  a"  >  N  ->  a  = 

(ii)  X  0  ~>  (a  +  D®  >  (a  +  l)*a®~*^  H  a  ♦  1  *  Fn^^"! 

Knowing  that  n  =  Fn^^®1,  Problem  2,22  can  be  reduced  to  m  -  1 
m 

variables,  with  N  replaced  by  N/n  .  From  the  same  considerations  as 

m 

above,  we  know  that  the  optimal  vector  (n,,  n.,,  ...,  n  ,)  for  the  new 

X  z  m**! 

problem  has  at  least  one  conponent,  say  n^  j,  equal  to 


Repeating  the  same  operation,  we  arrive  at  Eq.  (2.23). 


Numerical  examples 

Similar  curves  as  in  Fig.  2.5  are  plotted  in  Fig.  2.8.  They 


illustrate  the  behavior  of  I  (Id)  versus  m  for  the  integer  case.  They 
exhibit  the  properties  described  in  Section  2. 3. 2. 4,  mainly  the  enormous 
table  reduction  for  small  values  of  m.  By  conoparing  Fig.  2.5  with 
Fig.  2.8  we  notice  that  the  integer  valued  solution  is  extremely  close 
to  the  real-valued  solution.  Consequently,  we  will  limit  any  further 
considerations  to  the  simple  real- valued  solution. 

Fig.  2.9  illustrates  the  behavior  of  N^/N,  where 

"o '  ii 

Notice  that  the  slack  in  the  size  constraint  is  relatively  small  for 
large  values  of  N. 

2.3.4  Optimality  with  No  *'Self-En tries"  in  the  Routing  Table 

In  the  previous  model,  at  each  routing  table,  one  entry  (to  be 
called  self-entiy)  is  reserved  for  the  node  which  contains  that  table, 
and  one  for  each  of  the  level  clusters  k  =  1,  2,  ...,  m  -  1,  to 
which  that  node  belongs.  For  some  WR  schemes  (e.g.,  those  defined 
in  Chapter  3)  and/or  with  some  extra  CPU  overhead,  the  updating 
algorithm  can  operate  without  those  self-entries.  Consequently,  the 
new  length  A*  of  the  RT's  is, 

A'  =  A  -  m  (2.24) 

where  A  is  given  by  Eq.  (2.3). 

The  optimal  clustering  structure  is  the  solution  of  Problem 

2.4  where  A  is  replaced  by  A*. 

Real  Valued  Solution 

For  a  fixed  m,  Eq.  (2.5)  still  holds  true.  Hence  the  minimum 
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length  is 


Integer  Valued  Solution 

For  a  fixed  m.  Proposition  2.6  still  holds  true.  As  for  the 
global  optimum  the  problem  to  solve  is, 

min:  ^  n.  -  m 
k*l 

over:  n,  m  integer  valued  (2.25) 

m 

s.t. :  n  n.  >  N 
k=l  * 

Recall  that  the  real  valued  solution  is  such  that 

n,  *  limit  =  l"*^ 

m  » 

Iherefore  we  are  not  surprised  that  the  f61 lowing  proposition  holds 
true. 

Proposition  2.7 

Ihere  exists  a  non- degenerate  (i.e.,  no  one  conponent  is  eqml 
to  one)  global  optimum  vector  ii*  which  is  such  that 


and 


(2.26) 

(2.27) 


Proof 

Let  us  first  show  that  if  is  an  optimal  vector  which  contains 

at  least  one  con^nent  equal  to  1,  then  the  vector  obtained  after 

deletion  of  all  the  I's  is  also  optimal. 

Assume  that  n_  =1,  then  consider  n*  such  that 
m* 

”k“”k  k=l,...,m*-l. 
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Notice  that  the  table  length  and  the  product  of  the  coiqponents  are  un¬ 
changed.  Hius  n*  is  also  feasible,  hence  it  is  optimal.  Hie  above 
operation  can  be  repeated  several  times  until  we  eliminate  all  the 
components  which  are  equal  to  one.  In  the  sequel  we  restrict  our  con¬ 
siderations  to  non- degenerate  solutions  (i.e.,  no  one  component  is  equal 


to  one) . 


Let  ust  first  prove  this  intermediary  result. 


Fact: 


If  n*  is  a  non-degenerate  optimal  vector  then  it  must  be 
conposed  of  zero,  one  or  two  components  equal  to  3  with  all  the  others 
equal  to  2. 


Proof 


The  proof  proceeds  by  contradiction.  Assume  that  n*  is  a 


non-degenerate  optimal  vector  which  does  not  satisfy  the  above  fact. 

It  must  be  such  that  at  least  one  of  the  following  conditions  is  true; 
(1)  at  least  one  conponent  of  n*  is  greater  than  or  equal  to  4;  (2)  at 
least  three  conponents  of  q*  are  equal  to  3. 


With  respect  to  (1) ,  assume  that  n  >4,  then  consider  the 


vector  b'  , 


("k  ■  "k 

(nj  .  2 


k  »  1,  2,  . . . ,  m^  -  1 

i  1,  “*  ♦  “o 


(2.28) 


where  m^  is  such  that 


2"»<„  <2V 


m-+l  m*-l 


(2.29) 


n  n*  =  2 n  n  >  n  "k 

k»l  k»l  ^  k«l 
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Consequently  q,'  is  feasible.  Also  the  objective  function  is 

B.,-1 

Y.  -  Cm*  +  hIq)  =  2  \*  2(mQ  +  1)  -  (111^  +  m J . 

Ic*  1  lC“  X 

If  we  subtract  the  above  table  length  from  the  one  obtained  with  n*, 
then  the  difference  is 

A  =  n  -  m-  -  2  (2.30) 

m*  u 

If  A  >  0,  then  we  reach  a  contradiction.  Let  us  prove  that  in  fact 
A  >  0. 

i.  n  =4"->m-=l=>A=l 
m*  0 

ii.  n  >  5  -<*  m.  >  2  . 

m*  -  fl¬ 

it  can  easily  be  shown  by  induction  that 
®0 

2  >  iDq  +1  V-  iiIq  2 

From  Eqs.  (2.29)  and  (2.30), 

“0  ®0 

A>2"-m  -  2>2“-m  -1>0 
o  —  o 

Thus  A  is  always  greater  than  zero,  which  is  a  contradiction. 

With  respect  to  (2) ,  assume  that 


n 


'  A  ”  n  « 
m*-2  m*-l 


*  n_  =3 


consider  the  vector  q,'  composed  of  m*  2  elements. 


k  ®  1^  •••y  ^ 


*  2  ^  ^ 


m*  +  2 


It  is  obvious  that  n*  is  feasible  and  that  it  reduces  the  table  length 
by  one,  which  contradicts  the  fact  that  n*  is  optimal.  Q.E.D. 
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End  of  Proof  of  Proposition  2.1 


The  rest  of  the  proof  is  a  consequence  of  the  above  fact.  Let 
n*  be  a  non- degenerate  optimal  vector.  If  n^  does  not  satisfy  Prop¬ 
osition  2.7,  then,  because  of  the  above  fact,  it  must  contain  one  or 
two  3's.  Any  3  can  be  replaced  by  two  2's,  without  changing  the 
objective  function  and  still  keeping  feasibility.  As  a  result  we  arrive 
at  a  vector  con^osed  only  of  2's  and  for  that  vector  to  be  optimal  the 
number  of  components  must  be  a  solution  of 

/  min :  m 

/  over:  m  integer 
(  s.t. :  2™  ^  N 

The  solution  of  the  above  problem  is  given  by  Eq.  (2.27),  which  term¬ 
inates  the  proof. 

2.4  Conclusions 

This  chapter  was  primarily  concerned  with  the  introduction  of 
the  ftIR  schemes  and  their  underlying  hierarchical  clustering  structure 
as  solutions  to  the  reduction  of  the  routing  information  and  its 
associated  overhead.  We  found,  indeed,  that  enormous  gains  can  be 
obtained  whereby  the  length  of  the  routing  tables  may  be  reduced  from 
N  entries  to  the  order  of  e*ln(N)  entries.  However,  a  shortcoming  of 
these  gains  is  the  increase  in  the  path  length  of  a  message  in  the  net- 
woiic.  This  comes  about  from  the  fact  that  a  given  node  must  send  all 
its  traffic  to  a  given  cluster,  on  the  same  path  to  that  cluster.  This 
path  will,  in  general,  be  optimal  only  for  a  subset  of  the  nodes  in  the 
destination  cluster.  Consequently,  some  messages  will  follow  longer 


paths  than  they  should.  Ihis  issue  will  be  addressed  in  the  next 
chapter. 

It  is  also  possible  that  less  routing  adaptability  could  result 
from  the  MiR  schemes  because  of  the  aggregation  of  the  routing  informa¬ 
tion.  This  fact  may,  however,  be  beneficial  in  our  context  of  large 
networks  where  the  routing  policy  need  not  adjust  to  very  remote  and 
probably  short  lived  fluctuations. 
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CHAPTER  3 


STATIC  EVALUATION  OF  HIERARCHICAL  ROUTING: 
BOUNDS  ON  THE  INCREASE  IN  NETWORK  PATH  LENGTH 


Built  into  the  hierarchical  routing,  proposed  in  Chapter  2,  is 
the  reduction  of  routing  information.  As  a  result  there  will  be,  in 
general,  an  increase  in  the  network  path  length.  The  magnitude  of  the 
increase  is  closely  related  to  the  length  of  the  routing  table  and  to 
the  choice  of  a  specific  aggregate  routing  variable. 

In  this  chapter,  two  hierarchical  routing  schemes  and  a  non¬ 
clustered  (non-hierarchical)  scheme  are  presented,  and  their  routes  are 
characterized  under  some  "equilibrium"  conditions.  Then,  with  some 
assumptions  on  the  network  topology  and  on  the  partitioning  structure, 
bounds  on  the  increase  in  the  network  path  length  are  derived.  These 
bounds  are  expressed  and  studied  in  terms  of  the  relative  table  length 
(Jl/N) .  They  demonstrate  an  asymptotic  result  for  a  class  of  large 
networks,  The  result  shows  that  when  the  number  of  nodes  grows  to 
infinity,  enormous  table  reduction  can  be  obtained  with  a  relatively 
insignificant  increase  in  the  network  path  length. 

3. 1  Path  Characteristics  for  Hierarchical  and  Non-Hierarchical 

Adaptive  Routing  Policies 

The  purpose  of  this  section  is  to  characterize  the  actual  or 
virtual  routes  obtained  from  the  routing  tables  under  certain  equilib¬ 
rium  conditions  as  defined  below.  The  routing  schemes  are  assumed  to 
belong  to  the  class  of  hierarchical  or  non-hierarchical  adaptive 
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policies,  considered  in  Chapter  2.  Such  policies  basically  propagate 
routing  information  describing  the  length  of  the  paths  to  reach  any 
destination  node  or  a  set  of  nodes.  The  path  length  is  defined  as  the 
sum  of  the  lengths  of  all  the  channels  which  constitute  that  path. 
Moreover,  the  length  of  a  given  channel  is  usally  a  random  variable 
which  may  reflect  the  utilization  and/or  the  excess  capacity  and/or  any 
other  information  which  partly  or  entirely  describes  the  stochastic 
state  of  that  channel.  The  transient  nature  of  adaptive  routing  renders 
the  analysis  of  the  above  problem  extremely  complicated.  In  order  to 
make  any  progress  we  will  assume  that  all  channels  are  of  constant 
length.  This  is  a  sinplifying  assumption  which  will,  however,  allow  us 
to  capture  the  effect  of  clustering  on  the  network  path  length;  this  is 
the  main  objective  of  this  chapter.  Morever  the  above  assumption  is  an 
accurate  description  of  routing  policies  which  are  only  sensitive  to 
changes  in  the  network  topology,  and  of  more  general  policies  operating 
under  light  traffic  conditions  [KLEI  64].  Furthermore  if  all  the  chan¬ 
nels  are  considered  to  be  of  equal  length  (say  1) ,  then  the  routing 
information  is  sin5)ly  what  we  defined  earlier  as  the  hop  distance. 

Such  routing  information  is  in  general  utilized  by  routing  policies  to, 
at  least,  detect  changes  in  the  network  topology. 

In  summary  we  will  restrict  our  considerations  to  hierarchical 
or  non-hierarchical  routing  schemes  (also  referred  to  as  clustered  and 
non-clustered  routing  schemes)  which  use  as  routing  information  the 
path  length  only.  Also  we  consider  that  all  channels  are  of  constant 
length.  In  what  follows  we  first  assume  that  all  channels  are  of  equal 
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length  (one  hop)  and  then  we  generalize  to  arbitrary  (constemt)  length 
channels. 

3.1.1  Non-Clustered  Routing  (NCR) 

Recall  that  the  NCR  schemes  operated  with  complete  routing 
information,  i.e.,  the  routing  tables  contain  one  entry  per  destination 
node  in  the  network.  Because  of  the  frequent  exchange  of  routing  in¬ 
formation  between  neighboring  nodes,  the  routing  tables  will  in  general 
indicate  the  next  node  in  the  shortest  path  to  any  destination.  This 
is  due  to  the  fact  that  at  each  update,  a  node  con5)ares  its  own  routes 
to  the  neighbor's  routes  and  updates  its  table  by  keeping  the  shorter 
routes.  The  above  approach  is  in  essence  a  dynamic  search  for  the 
minimal  paths  in  the  network  [HU  69],  [FRAN  71].  As  a  consequence, 
after  a  certain  number  of  updates  (less  than  or  equal  to  the  diameter 
of  the  network)  if  no  changes  occur  in  the  network  topology,  all  the 
tables  will  indicate  exactly  the  shortest  paths.  If  a  change  in  topol¬ 
ogy  occurs,  then  some  of  the  paths  will  not  be  optimal  during  a 
transitory  period  (time  for  the  routing  information  to  percolate  to  the 
concerned  areas  in  the  network) .  However  for  the  purpose  of  coin)arison 
with  the  hierarchical  routing,  we  will  assume  that  the  tables  always 
carry  the  shortest  path  information.  A  more  formal  treatment  of  the 
above  is  presented  within  the  context  of  the  clustered  schemes  since, 
as  we  will  see  in  the  sequel,  a  1-level  hierarchical  routing  is  equiv¬ 
alent  to  an  NCR  scheme. 
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3.1.2  Specification  of  the  MHR  Schemes 


Built  into  the  htlR  schemes,  considered  in  Chapter  2,  is  the 
reduction  of  the  routing  information  whereby  one  entry  in  a  routing 
table  may  be  reserved  for  more  than  one  destination  node.  Routing 
information  is  aggregated  whenever  it  is  exchanged  between  special 
nodes  in  different  clusters  at  any  level.  Such  special  nodes  will  be 
referred  to  as  exchange  nodes.  Two  MIR  schemes  will  be  presented  below. 
They  differ  only  in  the  definition  and  subsequently  the  computation  of 
the  aggregate  routing  information.  The  two  schemes  will  be  referred  to 
as  the  Closest  Entry  Routing  (CER)  and  the  Overall  Best  Routing  (OBR) 
schemes.  In  order  to  proceed  with  their  description,  we  first  need  to 
specify  the  imderlying  m-level  hierarchical  partitioning  of  the  set  of 
nodes  of  the  network. 

Assumption  3.1 

The  underlying  MiC  structure  of  the  set  of  network  nodes  is 
such  that  all  clusters  at  the  same  level  k,  are  of  equal  degree, 
nj^,  k  =  1,  ...,  m.  Also  the  subset  of  nodes  coii5>osing  a  cluster  at 
any  level  and  their  incident  channels  constitute  a  1-connected  (at 
least  one  path  exists  between  any  pair  of  nodes)  cluster  subnetwork. 

The  former  property  of  the  above  assumption  partly  satisfies 
Proposition  2.1  which  defines  the  optimal  clustering  structure  that  we 
will  eventually  use.  The  latter  property  is  necessary,  since  the  traf¬ 
fic  exchanged  between  nodes  in  the  same  cluster  must  follow  paths 
included  in  that  cluster's  subnet. 

Because  of  the  above  assumption  the  notation  of  Chapter  2 
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can  be  greatly  simplified.  Particularly,  the  degree  vector  is  reduced 
to  n  =  (Hj,  n2>  •  Moreover,  if  there  is  no  need  to  identify 

a  cluster  with  its  entire  address  vector,  then  a  shorter  notation, 
such  as  below,  can  be  used. 

4  j  entry  in  k  level  cluster  entries 
or 

4  ^  level  cluster. containing  an  arbitrary  node  s. 

As  a  consequence  of  Assumption  3.1,  the  routing  tables  at  any 

node  will  have  the  format  shown  in  Fig.  3.1,  and  will  contain 

£.  =  n,  +  n_  +  ...  +  n  entries.  Notice  that  self  entries  are  included 
1  z  m 

in  the  routing  table  (RT) .  The  self  entries  of  the  RT  at  an  exchange 
node  may  be  assigned  to  carry  the  aggregate  routing  information  from 
one  cluster  to  another.  The  content  of  the  self  entries  in  tables  at 
other  nodes  (non-exchange  nodes)  need  not  be  specified  in  this  study. 

Two  aggregation  procedures,  each  pertaining  to  a  particular  MHR  scheme 
(OBR  or  CER),  are  presented  below.  Also,  possible  ways  of  in^lementing 
those  procedures  are  suggested  with  the  assumption  that  neighboring 
nodes  exchange  their  RT's  as  routing  updates  (as  mentioned  in 
Chapter  2) . 

Aggregation  of  Routing  Information 

i.  With  respect  to  the  CER  (Closest  Entry  Routing)  scheme, 
no  routing  information  describing  the  internal  behavior  of  a  cluster 
is  propagated  outside  the  cluster.  With  this  rule,  a  cluster  is  re¬ 
garded  from  the  outside  as  a  single  (super-)  node  whose  distance  to 
itself  is  equal  to  zero.  A  way  of  implementing  this  scheme  is  to  assign 


(HOP  FIELD) 


NEXT  PATH 
NODE  LENGTH 


^  n,  0**'  LEVEL  CLUSTER 
ENTRIES 


k.1*<  LEVEL  CLUSTER 
ENTRIES 
5 


^  "k+1  LEVEL  CLUSTER 
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W  m-1**  LEVEL  CLUSTER 
ENTRIES 


Figure  3.1.  Routing  Table  at  Node  ».  with  an  m-Level  Hierarchical  auitaring. 
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zero  as  the  content  of  all  the  self  entries  at  any  RT  located  at  an 
exchange  node.  In  other  words,  the  distance  from  an  exchange  node  to 
the  clusters  at  all  levels,  to  which  it  belongs,  is  considered  to  be 
equal  to  zero. 

ii.  As  for  the  OBR  (Overall  Best  Routing)  scheme,  the  average 
estimated  distance  from  an  exchange  node  to  all  the  nodes  in  its  cluster 
(including  itself)  will  be  propagated  when  necessary  as  the  aggregate 
routing  information  relative  to  that  cluster.  A  possible  implemen¬ 
tation  is,  at  any  exchange  node's  RT,  to  assign  to  the  hop  field  of  the 

level  cluster  self  entry  the  average  of  the  contents  of  the  hop 
St 

fields  of  all  the  k-1  level  cluster  entries  in  that  RT.  When  re¬ 
quired,  the  computation  of  the  averages  must  proceed  sequentially, 
starting  from  k=ltok=m-l  (see  Fig.  3.1).  The  hop  field  of  the 

0  level  cluster  self  entry  is  always  set  to  zero.  The  interpretation 

s  t 

of  the  above  rule  is  that,  at  any  exchange  node,  say  e,  the  1  level 

cluster  self  entry  of  the  RT  indicates  the  estimated  average  distance 

s  t 

from  node  e  to  any  node  in  the  same  1  level  cluster  as  e,  including  e. 
In  general,  the  k^^  level  cluster  self  entry  indicates  the  estimated 
average  distance  (in  hops)  from  node  e  to  any  node  in  the  same  k^^  level 
cluster  as  e,  including  e. 

Notice  that  the  content  of  the  self  entry  fields  can  be  well 
computed  by  the  exchange  node  receiving  the  routing  update  or  by  the 
one  generating  the  update.  This  fact  will  become  clear  in  the  example 
treated  right  after  the  specification  of  the  update  rule. 

Update  rule 

Let  s  and  t  be  two  neighbor  nodes  (i.e.,  they  are  connected 
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th 

by  a  channel  (s,  t))  which  belong  to  the  same  k  level  cluster  Cj^  and 

not  to  any  lower  level  cluster,  (k  *  1,  2,  m) .  Let  and 

st 

Cj^  j(t)  respectively  denote  the  k-1  level  clusters  to  which  s  and  t 
belong.  From  the  condition  above,  we  know  that 


Cj^_l(s)  ,  <=  Cj^  and  C^.jCs)  n  =  <j) 


As  a  consequence,  and  due  to  the  RT  organization  as  specified  in  Chap¬ 
ter  2,  the  routing  tables  at  s  and  t  are  such  that  all  the  p-level 
cluster  entries  for  p  =  0,  1,  ...,  k  -  2  refer  to  different  cluster 
destinations,  whereas  all  the  other  entries  refer  to  the  same  cluster 
destinations. 

The  object  of  the  updating  procedure  is  to  coii5)are  the  esti¬ 
mated  lengths  of  the  paths  from  s  or  t  to  any  common  destination.  Then, 
the  routing  tables  are  ipdated  to  show  the  better  paths.  More  formally, 
let 


C.(i) 


i  =  1,  2,  ...,  n^^jl 


j=k-l,  ...,m-l 


denote  a  common  (to  s  and  t)  level  cluster  destination.  To  that 
cluster  is  associated  entry  i  (at  both  tables)  among  the  level  clus¬ 

ter  entries;  that  entry  will  also  be  denoted  by  Cj(i)  (Fj^g.  3,1).  Also 
let  HF(u,  Cj(i))  represent  the  content  of  the  hop  field  of  entry  (i) 
at  node  u  (u  =  s  or  t) .  Finally,  whenever  node  t  receives  an  update 
message  from  node  s,  then  for  each  common  destination  entry  Cj (i)  the 
following  iq)dating  algorithm  is  performed. 


IF  HF(t,  C^(i))  >  1  +  HF(s,  C^.(i)) 

THEN  HF(t,  Cj(i))  1  +  HF(s,  Cj(i)) 

NEXT  NODE  FIELD  OF  (i)  s 
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Initially  all  the  entries  are  set  to  a  large  value  (») ,  except  for  the 
self  entries.  If  a  CER  is  used  then  all  the  self  entries  are  set  to 
zero,  and  if  an  OBR  is  used  then  only  the  level  cluster  self  entries 
are  set  to  zero,  e.g. ,  at  node  s 

!HF(s,  C  (s))  a  HF(s,  s)  =  0 
0  - 

(0  CER 

HF(s,  C.  (s))  =  <  k  =  1,  in  -  1 

(  <»  OBR 

all  other  entries  =  «*> 

Notice  that  in  the  algorithm  above,  it  is  assumed  that  all  the  routing 

information  contained  in  the  non-common  destination  entries  in  node  s 

routing  table  is  aggregated  as  specified  before,  to  represent 

HF(s,  Cj^  j(s)).  Moreover  the  content  of  the  common  self  entries  is  not 

relevant. 

The  fact  that  HF(s,  Cj^.jCs))  need  only  be  conqputed  for  updating 

purposes,  makes  it  clear  that  either  the  sending  (s)  or  the  receiving 

node  (t)  can  perform  the  aggregation  of  routing  information. 

A  few  more  remarks  can  be  stated  about  the  above  updating  rule. 

i.  If  s  and  k  belong  to  the  same  1  level  cluster,  then 

their  RT's  contain  only  common  destination  entries.  As  a  result. 

Algorithm  3.1  will  be  performed  for  all  the  entries  in  the  table. 

ii.  A  unique  "degenerate"  WIR  routing  scheme  corresponds  to 

either  the  OBR  or  the  CER  schemes  with  only  one  hierarchical  level,  and 

it  is  exactly  the  non-clustered  scheme  specified  above.  Moreover,  for 

such  a  degenerate  case  all  the  network  nodes  belong  to  the  same  unique 
St 

1  level  cluster;  hence,  as  expected,  the  updating  algorithm  will  be 
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performed  for  all  the  entries  in  the  RT's. 

iii.  For  any  pair  of  nodes  s,  t,  the  common  region  in  the  rout¬ 
ing  tables  can  be  determined  by  inspecting  the  address  vectors  of  s  and 
t.  A  similar  operation  is  required  by  the  routing  function  described 
later  in  this  section. 

We  are  now  ready  to  treat  an  example  which  will  expose  the 
mechanisms  of  updates  and  aggregation. 

Example  of  update 

Let  us  consider  the  14  node  network  shown  in  Fig.  3.2. 


If  no  clustering  is  considered  then  each  node  contains  a  14-entry  rout¬ 
ing  table.  Numbers  from  1  to  14  are  assigned  to  the  nodes  to  serve  as 
addresses  (shown  on  the  right  of  each  node  in  Fig.  3.2).  If  a  hierar¬ 
chical  routing  is  used  then  the  underlying  clustering  structure  is 
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composed  of  2  clusters  Cj^(l),  0^(2)  containing  7  nodes  each,  i.e., 
ni  =  7,  n2  =  2,  m  =  2  (see  Fig.  3.2).  The  node  addresses  are  as  de¬ 
fined  in  Section  2. 3. 1.1,  and  they  are  shown  on  the  left  of  the  nodes 
in  Fig.  3.2.  With  this  clustering  structure  the  length  of  an  RT  is 
reduced  to  9. 

In  what  follows  we  will  present  the  outcome  of  the  updates  and 
a  few  key  steps.  The  initialization  is  performed  as  mentioned  above. 
After  a  certain  number  of  updates,  if  an  NCR  is  used,  all  the  RT's  will 
show  (in  the  hop  field)  the  shortest  distances  to  all  the  nodes  in  the 
network,  and  the  first  nodes  of  the  corresponding  shortest  paths  in  the 
"next -node"  field.  A  typical  RT  is  shown  in  Table  3.1. 


Destination 

Next  Node 

Hop  Field 

1 

1 

1 

2 

1 

2 

3 

1 

3 

4 

8 

2 

5 

8 

3 

6 

8 

4 

7 

8 

3 

8 

8 

1 

9 

8 

2 

10 

11 

2 

11 

11 

1 

12 

X 

0 

13 

13 

1 

14 

13 

2 

Node  12,  NCR  Used 


Table  3.1  RT  at  Node  12 
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If  an  OBR  or  a  CER  is  utilized,  then  updates  between  nodes  in 
the  same  cluster  are  performed  similar  to  the  non-cliistered  case.  As  a 
consequence,  after  a  certain  number  of  updates  the  RT's  will  show  the 


shortest  internal  (included  in  the  cluster)  paths  for  nodes  in  the  same 

cluster.  Tables  3. 2. a  and  3.2.b  show  the  outcome  of  the  updates  at  the 

exchange  nodes  routing  tables.  For  the  sake  of  clarity,  the  contents 
s  t 

of  the  1  level  cluster  self  entries  are  not  shown,  and  also,  the 
other  cluster  entries  are  left  equal  to  the  initial  value  (“) . 


Destination 


(1.1) 

0 

(1.1) 

1 

(1,3) 

3 

(1,2) 

(1,2) 

1 

0 

(1,3) 

2 

(1,3) 

(1,2) 

2 

(1,3) 

1 

(1.3) 

1 

(1.4) 

(1,2) 

3 

(1.3) 

2 

0 

(1,5) 

(1,2) 

4 

(1.3) 

3 

(1.5) 

1 

(1.6) 

(1.2) 

5 

(1,3) 

4 

(1,5) 

2 

(1,7) 

(1,2) 

4 

(1,3) 

3 

(1,7) 

1 

(1,  ) 

(2,  ) 

00 

00 

00 

at  node 

(1 

.1) 

(1 

,2) 

(1 

.4) 

Destination 


(2,1) 

0 

(2.1) 

1 

(2.1) 

B 

(2,2) 

(2,2) 

1 

(2.1) 

2 

(2,6) 

(2,3) 

(2,2) 

2 

(2,4) 

2 

(2.6) 

(2,4) 

(2,5) 

2 

(2.4) 

1 

(2,6) 

B 

(2,5) 

(2.5) 

1 

0 

(2.6) 

B 

(2,6) 

(2,5) 

2 

(2.6) 

1 

(2,6) 

1 

(2,7) 

(1,  ) 

(2,  ) 

(2.5) 

3 

00 

(2,6) 

2 

00 

0 

oo 

at  node 

(2.1) 

(2.5) 

(2.7) 

Table  3.2.b  RT's  at  the  Exchange  Nodes 


Let  us  now  look  at  the  exchange  of  iq)dates  between  nodes  in 

different  clusters.  As  an  exanople,  assume  that  node  (2,5)  receives 

update  from  node  (1,1).  The  first  7  entries  (in  their  RT's)  do  not 

refer  to  common  destinations  and  as  such  they  are  only  utilized  to 

compute  the  aggregate  routing  information.  That  information  (0  for  CER 

s  t 

average  for  OBR)  is  conputed  and  stored  in  the  1  level  cluster  self 


entry  at  node  (1,1) 's  RT,  by  either  the  sending  or  receiving  node. 
(Results  of  such  conputations  are  shown  in  Table  3.3.) 


After  the  coii5)utation  of  the  aggregate  information.  Algorithm 
3.1  is  performed.  TTie  outcome  of  this  single  step  is  shown  in  Table 
3.4  for  all  the  exchange  nodes  in  Cj(2). 


at  node  (2,1)  (2,5)  (2,7) 


Table  3.4  Intermediate  Update 


TTiis  information  recorded  in  entry  Cj(l)  in  the  above  RT's  will 
percolate  inside  cluster  0^(2).  At  one  point  (2,5)  will  receive  an 
iq)date  from  (2,1)  which  will  (if  OBR  is  used)  trigger  a  change  in 
Cj(l)'s  entry.  It  can  be  easily  checked  that  entries  reserved  to  0^(1) 
in  (2,1)  and  (2,7)  will  not  be  affected  by  i^dates  coming  from  nodes  in 
Cj(2).  As  a  result,  the  final  outcome  for  such  entries  is  as  shown  in 
Table  3.5. 
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(1,4) 

I  (2,1) 

3  +  3/7 

1 

1 

(1,2) 

3 

1 - 

1 

1 - 

1 - 

- 1 

(1,4)  1  (1.1)  1 


at  node  (2,1)  (2,5) 

Table  3.5  Final  Outcome 


(2.7) 


Notice  that  the  resulting  average  path  length  from  (2,5)  to  the  nodes 
in  Cj(l),  is 


3  +  5/7 

CER 

(computed  from  Table  3.5) 

3  +  3/7 

OBR 

(see  Table  3.5) 

2  +  4/7 

NCR 

(see  Table  3.1) 

As  for  (2,1)  and  (2.7),  that  average  path  length  remains  the  same  with 
either  one  of  the  hierarchical  schemes.  The  comparison  of  the  above 
numbers  shows,  as  expected,  that  globally  CER  will  induce  a  longer  net¬ 
work  path  length  than  OBR  which,  in  turn,  induces  a  longer  path  than 
NCR. 

Routing  Function 

The  problem  here  is  to  decide  iq>on  the  RT  entry  to  utilize  in 
order  to  forward  (or  to  send)  a  message  from  a  node  of  address 
i  =  (i„,  ij)  to  a  node  of  address  j  =  (j^,  jj) 

(see  Section  2. 3. 1.1).  Let  us  first  notice  that  if  k  is  the  level  of 
the  lowest  level  cluster  to  which  both  nodes  belong,  then  their  address 
vectors  must  match  starting  from  the  left,  up  to  and  including,  the 
index  k  *  1 ,  i.e. , 


ip  =  jp  p  =  m,  m  -  1,  ....  k  +  1 


‘k  *  jfc 


As  an  exan^le  we  compare  the  addresses  of  a  few  pairs  of  nodes  chosen 
from  the  3  level  clustered  network  shown  in  Fig.  2.1.  The  outcome  of 
the  coin)arison  is  given  in  the  table  below. 


(j3»  j2»  ij) 

k 

(1,3,3) 

(3,2,1) 

3 

(1,3,3) 

(1,2,2) 

2 

(1,3,3) 

(1.3,2) 

1 

Table  3.6  Matching  the  Address  Vectors 


Because  of  the  above  property,  the  routing  algorithm  can  then 
easily  perform  a  sequential,  from  left  to  right,  matching  of  the  address 
vectors  to  find  the  value  of  k.  Then  the  next  node  field  of  the 
entry  among  the  k-1^^  level  cluster  entries  (see  Fig.  3.1)  will  indicate 
the  next  node  on  which  to  forward  the  message. 

With  the  above  specifications  of  the  MIR  and  NCR  schemes,  we 
are  now  ready  to  address  the  question  as  to  what  is  the  content  of  the 
hop  fields  at  any  RT,  under  some  defined  equilibrium  conditions. 

3.1.3  Path  Characteristics 

If  no  changes  occur  in  the  topology  of  the  network,  after  a 
certain  number  of  updates  the  contents  of  the  hop  fields  in  the  routing 
table  will  reach  "minimal"  constant  values.  In  what  follows,  this 
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situtation  will  be  referred  to  as  equilibrium  condition.  Similar  to 


the  dynamic  programming  approach,  the  above  property  is  due  to  the  fact 
that  improvements  are  made  sequentially  at  each  update  over  the  distance 
from  one  node  to  any  cluster  (see  Algorithm  3.1).  The  question  arises 
as  to  what  is  the  meaning  of  the  routing  information  at  equilibrium,  or 
in  other  words,  what  are  the  characteristics  of  the  paths  indicated  by 
the  routing  tables.  We  have  already  noticed  that  for  the  degenerate 
one  level  hierarchical  clustering,  i.e.,  when  no  clustering  is  used, 
those  paths  correspond  to  the  shortest  paths  in  the  current  topology. 
Before  we  proceed,  a  few  more  definitions  and  notation  are  necessary. 

h^^  =  Length  of  the  estimated  minimum  path  from  node  p  to 
node  t  as  derived  from  the  routing  information  at 
node  s.  (The  superscript  c  stands  for  clustered 
routing.) 

Internal  path  »  a  path  is  defined  to  be  internal  (included)  in  a 
cluster  Cj^  if  all  the  nodes  in  that  path  belong  to 
that  cluster. 

h^^  =  Length  of  the  shortest  path  from  node  s  to  node  t 
included  in  the  lowest  level  cluster  to  which  both 
s  and  t  belong. 

Exchange  node  =  (defined  previously)  An  exchange  node  (to  be  denoted 
by  e  or  e^)  of  a  given  cluster  is  a  node  of  that 
cluster  which  is  connected  to  one  or  more  nodes 
external  to  that  clustsr. 
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a  Subset  of  all  the  exchange  nodes  which  connect 
cluster  with  any  level  cluster  which 

belongs  to  the  same  k+1  level  cluster  as 

w  .  »  Aggregate  variable  representing  cluster  C.  and  as 
eCj^  -k 

computed  from  the  routing  information  contained  at 
the  exchange  node  e  of  Cj^. 

From  the  above  definitions  and  previous  specifications  we  no¬ 
tice  first  that  a  ^letwork  node  (0^^  level  cluster)  is  its  own  exchange 


node,  second  that 


—  2  "ff 
Ic^l 


for  the  OBR  scheme 


for  the  CER  scheme 


(3.2) 


where  |  C|^  |  represents  the  number  of  nodes  in  cluster  C|^  and  f  is  an 
arbitrary  node  of  Cj^.  The  above  considerations  allow  us  to  character¬ 
ize  the  path  lengths  under  the  MIR  schemes. 

Proposition  3.1 

Let  s  and  t  be  two  arbitrary  nodes  which  belong  to  the  same  k^** 
level  cluster  Cj^  but  not  to  any  lower  level  cluster;  then  the  length  of 
the  path  from  node  s  to  node  t  as  derived  at  e<}ui  librium  from  the  routing 
information  contained  at  node  s,  satisfies  the  recursive  equation  below. 


■>»  - 


(3.3) 
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where  e^  is  an  exchange  node  of  which  is  such  that 

h^  +  w  o  *  »in  <h^  +  w  ^  .. 

®®0  OjCAj^  jCt)!  ®®j  ®jS-l^^^) 

St 

where  level  cluster  tdiich  contains  node  t,  and 

Aj^_^(t)  is  its  corresponding  subset  of  exchange  nodes  as  defined  above. 


Proof 


The  proof  proceeds  by  induction  on  the  level  k  of  the  lowest 


level  cogmon  cluster.'  In  what  follows  C^Cs)  and  Cj(t),  j  >  1,  ....  m, 
will  always  respectively  denote  the  j  level  cliisters  to  which  s  and  t 
belcmg. 
k  »  1 

s,  t  belong  to  the  sane  1^^  level  cluster  then 
CqCs)  -  ^(s)  =  {s} 

Co(t)  »  ApCt)  =  (t) 

Also,  since  the  distance  of  a  node  to  itself  is  zero,  then 

In  order  to  prove  Eq.  (3.3)  there  zemains  to  show  that 

>•»  ■  >■« 

i.e.,  that  h®^  is  the  length  of  the  shortest  path  from  s  to  t  included 
in  Cj.  This  is  true  since  the  RT  of  any  node  in  contains  an  entry 
for  node  t;  hence  at  equilibrium  we  obtain  the  minimal  internal  path 
from  s  to  t.  Notice  that  if  m  *  1,  i.e.,  the  degenerate  case,  all  nodes 
belong  to  the  same  cluster  which  corresponds  to  the  entire  set  of 
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nodes,  hence  In  other  words,  when  no  clustering  is  used 


the  routing  information  indicates,  at  equilibrium,  the  shortest  (hop) 
path;  which  checks  with  the  conclusion  reached  in  Section  3.1.1. 

Assuming  that  Proposition  3.1  is  true  up  to  k  -  1,  let  us  show 
that  it  is  true  for  k. 

Proof  for  k 


th 


Let  be  the  k  level  cluster  common  to  s  and  t.  All  the 
nodes  in  Cj^  contain  in  their  RT's  one  entry  for  cluster  Cj^  j(t).  The 
propagation  and  the  subsequent  updating  of  the  RT's  among  the  nodes  of 
C|^,  is  equivalent  to  finding  the  minimum  path,  internal  to  Cj^,  from  any 
node  in  {Cj^  -  fictitious  supemode  SCj^  j(t)  shown  in 

Fig.  3.3.  In  other  words,  seen  from  any  node  in  {Cj^  -  Cj^  j(t)}. 


®j  ^ 


Figure  3.3.  Equivalant  Raprnantation  of  Cluster  C|^.^  (t). 

cluster  is  equivalent  ,  in  terms  of  distance,  to  a  center  node 

SC|^_j(t)  connected  to  all  the  exchange  nodes  in  Aj^  j(t).  If 

Cj  £  then  the  length  of  the  equivalent  edge,  from  e^  to  the 


center  node,  is  equal  to  the  aggregate  information  representing  cluster 
Cj^  j(t)  as  seen  from  e^,  i.e.. 


i(e  ,  SC  (t))  =  _  . 

j  Tc-l  e.C^jl 


(3.5) 


Notice  if  the  CER  scheme  is  used  then  the  above  equivalent  represen¬ 
tation  reduces  to  the  single  node  SCj^  j(t).  In  general,  from  Eq.  (3.2) 


iSc-l^^^l  feC^  j(t) 


(OBR) 


(3.6) 


(CER) 


5  V  C 

Since  e.  and  f  belong  to  the  same  k-1  level  cluster  then  h  ,  is 
1  e .  r 

J 

defined  because  of  the  induction  hypothesis.  Hence,  the  above  equation 
is  known. 

If  e^  is  the  exchange  node  in  ^(t)  which  belongs  to  the 

minimal  path  from  s  to  SCj^_j(t)  obtained  at  equilibrium,  then  e^ 

satisfies  Eq.  (3.4)  which  represents  the  length  of  that  minimal  path. 

Due  to  the  routing  function,  previously  specified,  all  messages  to  be 

forwarded  or  sent  from  node  s  to  node  t  will  follow  the  same  minimal 

path  vqp  to  the  exchange  node  e^.  At  that  point  Oq  and  t  belong  to  the 
St  c 

same  k-1  level  cluster,  hence,  h  ^  is  known  from  the  induction 

®0 

hypothesis.  Consequently 


h^^  =  h^  +  ^ 

St  sej,  ept 


where  e^  satisfies  Eq.  (3.4) 


Q.E.D. 
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Remarks 


If  the  CER  scheme  is  used,  then  Eq.  (3.4)  becomes 


'0  e 


min  {h^  } 

‘  se.' 


(3.7) 


The  above  equation  indicates  that  e^  is  the  closest  exchange  node  of 
C|^  j^(t)  (for  paths  included  in  node  s.  This  explains  the 

nomenclature:  Closest  Entry  Routing. 

ii.  Arbitrary  Channel  Lengths.  Instead  of  assigning  (as 


above)  length  1  (1  hop)  to  all  the  channels,  we  intend  to  let  the  chan¬ 
nels  be  of  variable  lengths.  Let  a^^  represent  the  length  of  channel 
(i>j)*  then  the  length  of  a  path,  from  node  s  to  node  t  is  defined 


""  (ij)ev  '' 


(3.8) 


The  specifications  of  the  MIR  and  the  NCR  schemes  are  still  valid 
except  that  any  hop  distance  is  replaced  by  the  new  assigned  length. 
Hence  Algorithm  3.1  becomes 


IF  HF(t,  C.(i))  >  a^^  ♦  HF(s,  C  (i)) 

J  J 

THEN  HF(t,  Cj(i))  a^g  +  HF(s,  C^d)) 

NEXT  NODE  FIELD  OF  C^  (i)  s  END. 

c  i 

Also  if  we  let  h  ,  h  ,  and  w  „  be  as  defined  previously,  then 

Sv  SC 

Proposition  3.1  still  holds  true. 


(3.9) 


3.2 


Bounds  on  the  Increase  in  Path  Length 

The  effect  of  the  clustering  (reduction  of  routing  information) 

is  the  increase  of  the  path  length  between  any  pair  of  nodes,  s,  t, 

of  an  amount  h  ^  -  h  A  measure  of  performance  of  the  WR  schemes 
St  St 

could  be  the  relative  increase  of  the  average  path  length,  i.e., 

h 

D  =  IP  -  1  (3.10) 

where  h^  and  h  denote  the  average  path  length  in  the  network  respec¬ 
tively  with  and  without  clustering, 

(3.11) 

"c  '  J  ■'st 

Proposition  3.1  provides  a  means  for  confuting  the  values  of 
hg^  for  any  pair  of  nodes  s,  t,  for  a  given  outcome  of  the  m-level 
hierarchical  clustering  of  the  set  of  nodes,  S.  Consequently,  for 
that  particular  situation,  it  is  numerically  possible  to  evaluate  the 
relative  increase  D,  Eq.  (3.10)  and  then  coinpare  the  clustered  with  the 
non-clustered  schemes.  Moreover,  with  further  assumptions  on  the 
structure  of  the  hierarchical  partitioning  of  the  nodes,  we  can  obtain 
analytic  bounds  on  the  increase  in  the  path  length. 

Assumption  3.2 

1  th 

The  diameter  of  any  level  cluster  subnet  (see  assumption 
3.1)  is  less  than  or  equal  to  a  quantity  dj^,  k  =  1,  ...,  m. 

^Recall  that  the  diameter  of  a  network  is  the  maximum  shortest  path 
between  pairs  of  nodes  [HARA  72]. 
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Notice  that  represents  the  diameter  of  the  entire  network 
and  that  dj^  >  dj^_j  ^  0  for  all  k's. 

Assumption  3.3 

Any  cluster  at  any  level  k  =  1,  2,  m  contains  the 

shortest  path  (if  it  is  not  unique,  then  at  least  one  is  contained) 
between  any  given  pair  of  nodes  which  belong  to  that  cluster. 

Assumption  3.2  is  simply  the  specification  of  the  outcome  of 
the  clustering  of  the  nodes,  since  the  d^'s  can  be  of  any  value. 

Whereas  Assumption  3.3  is  a  natural  property  that  any  clustering  scheme 
should  seek.  Ihe  reason  for  this  is  that  traffic  between  nodes  in  the 
same  cluster  must  (because  of  the  routing  function  above)  follow  paths 
internal  to  that  cluster. 

The  above  assun^tions  lead  to  the  derivation  of  some  siiq)le 
bounds.  These  bounds,  on  the  increase  in  path  length,  pertain  to  the 
routing  schemes  (OBR,  CER)  described  in  Section  3.1.  All  the  prop¬ 
erties  listed  below  rely  on  Assumptions  3.1  and  3.2.  If  Assunption  3.3 
is  used,  it  will  be  so  specified. 

Lemma  3.1 

Uhder  the  above  conditions,  the  value  of  h^^  for  any  pair  of 

nodes  s,  t  which  belong  to  the  same  k*’*'  level  cluster  is  such  that 

V  s,  t  e  same  k^^  level  cluster 
d.  (3.12) 

Vk=l,  2,  ...,m 

Proof 

The  proof  proceeds  by  induction  on  k.  First  if  k  =  1  then 
similar  to  the  proof  of  Proposition  3.1  (for  k  =  1)  =  h^^.  Then 
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i  c 

froB  Assumption  3.2,  ^  d^,  hence  ^  dj  which  checks  Eq.  (3.12). 

Now  assuming  that  Lemna  3.1  is  true  for  all  levels  iq>  to  k  -  1,  let  us 
show  that  it  is  true  for  k. 

Let  s,  t  be  a  pair  of  nodes  which  belong  to  the  same  k^*'  level 
cluster  Cj^.  Let  Cp  be  the  lowest  level  cluster  to  which  both  s  and  t 
belong,  hence  p  If  p  <  k  then  from  the  induction  hypothesis. 


< 


which  checks  Eq.  (3.12).  Else  if  p  =  k,  then  Eq.  (3.3)  holds  true.  In 
Eq.  (3.3),  because  of  Assumption  3.2 


s,  e.  e  C.  h_  <  d, 
0  k  sCq  —  k 


Also,  because  of  the  induction  hypothesis 


k-1 


Jq,  t  e  C.  .(t)  ”>  h  1  X]  d. 

u  e^i  -  .  j  j 


y- 

Substituting  the  above  relations  into  Eq.  (3.3),  we  arrive  at  Eq.(3.12). 

Lemma  3.1  leads  to  a  bound  on  the  increase  in  the  average 
path  length  and  on  D  (Eq.  (3.10)). 


Proposition  3.2 

Under  the  conditions  above  and  Assumption  3.3,  the  increase  in 
the  average  path  length  in  the  network  due  to  the  reduction  of  routing 
information,  is  such  that: 

m-1  r-  n.n_...n,  -  1  -i 

h, -■■<  £  [i-  *  V-? — J-^ 
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1 


Proof 


Let  denote  the  k  level  cluster  (k  =  0,  1,  . 


. . ,  m)  to 


which  s  belongs.  Then  from  Eq.  (3.11), 


^c  ■  ^  "  N(N  -  1) 


sS  k?i  te^Cs) 

tic^.iCs) 


(3.14) 


The  above  expression  was  obtained  through  the  deconposition  of  the 

computation  of  the  average  over  all  the  nodes  in  the  network.  Then  for 

St 

a  given  node,  s,  the  decomposition  i  done  over  nodes  in  the  same  1 
level  cluster  as  s,  excluding  s  (Cq(s)  =  s) ,  then  over  nodes  in  the  same 
2*^*^  level  cluster  as  s,  excluding  those  in  the  same  1^^  level  cluster 
as  s,  etc. . . 

s  t 

Let  Cj^  j^(j)  be  a  k-1  level  cluster  included  in  Cj^(s);  there 
are  n^^  such  clusters,  then 

"k 

E  -  h  )  ^  s  E  (h't  -  h  ) 

tec.(s)  j=l  tec.  ,(j) 


(3.15) 


Since  Cj^_j(j)  ^  j(s)  =  <(>  and  both  are  included  in  Eq.  (3.3) 

holds  true  for  s  and  any  node  t  in  Cj^  ^(j),  hence 

^  ^st  =  Is  l^j^l  ^se  ^  ^  •'e  t 

eeC^_j(j)  ^  teC^_j(j)  V 

where  e^  satisfies  Eq.  (3.4).  TVo  cases  to  consider  are 


i.  OBR  scheme.  From  Eq.  (3.2j 


I 
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w 


oSc-1^^^  feCj^j(j)  ®0^ 


(3.17) 


Substituting  Eqs.  (3.17)  and  (3.4)  (in  Eq.  (3.4)  t  is  replaced  by  j) 
into  Eq.  (3.16),  we  arrive  at 


^  ^st  “  Ke  “e  C  (i)^ 

teC^.l(j)  ^  ^  ®j^\.lfj^  j  ®jS-l”^ 


(3.18) 


Let  us  define  e^  to  be  the  closest  (inside  Cj^(s))  exchange  node  of 
Aj^_j(j)  to  node  s,  i.e.. 


h\ 


min  {h^^  } 


se  se. 

s  ej£\.i(3)  J 


(3.19) 


Prom  Eq.  (3.18)  and  for  any  exchange  node  e.,  particularly  e  ,  the 

3  s 

relation  below  is  true. 

^  ^  »‘et 

teC^.l(3)  s  teC^.i(j)  s 

Note  that  in  the  equation  above  w  was  replaced  by  its  value  as  defined 
by  Eq.  (3.2) . 

Moreover,  from  Assunption  3.3 


^st  =  ^t  ^  ^ 


(3.21) 


Then  from  the  definition  of  e^. 


h  .  =  h^*  >  h^ 
St  St  —  se. 


V-  t 


(3.22) 


Consequently, 
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t£C|^.,0) 


se 


Substituting  Eq.  (3.23)  into  Eq.  (3.20),  we  arrive  at 

S  <''st  -  ■‘st’  -  £  ''e  t 

Note  that  e^,  t  e  Lemma  3.1, 

k-1 


<  I  d. 


V  t  e 


From  Assunption  3.1 


|Cj^.l(j)l  =  "i"2*‘*"k-l  ^ 

Substituting  Eq.  (3.24  -  3.26)  into  Eq.  (3.15),  we  find 

k-1 

^  (h  ^  Xi* 

tec  (S)  1  4  K  I  3 

Note  that  this  last  equation  is  true  for  any  level  k,  and  for 
s,  hence  by  substituting  it  into  Eq.  (3.14),  we  obtain 

m  k-1 


h  -  h 
c 


-N  -  1  "l”2’''”k-l^"k  ■  ‘^j 


0 


Note  that  for  k  =  1,  5!^  =0. 

j=l 

Interchanging  the  summations  in  the  equation  above,  we  get 

m-1  m 


h,  -h 


\  ***  * 

-N"^  *^3  .  "l"2’“"k-l^"k 

j  =  l  •'  k*j+l 


1) 


(3.23) 


(3.24) 


(3.25) 


(3.26) 


(3.27) 


any  node 
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which  after  some  algebra,  gives  Eq.  (3.13). 

To  end  the  proof  it  is  left  to  show  that  the  Cl-R  scheme  also 
satisfies  Proposition  3.2. 

ii.  CER  Scheme.  From  Eqs.  (3.2)  -  (3.4)  and  (3.19),  we  find 


(  h*^  =  h^  +  h*^ 

)  St  se^  e^t 


+  E  ^  . 

f'i  e  t 
teC:j^_l(j)  s 


(3.28) 


This  equation  is  equivalent  to  Eq.  (3.20)  except  that  the  inequality  is 
tight  here.  TVjo  conclusions:  (1)  the  rest  of  the  proof  can  proceed 
exactly  as  in  (i) ,  (2),  Eq.  (3.28)  conpared  to  Eq.  (3.20),  indicates 
that  the  summation  of  path  lengths  obtained  with  the  OBR  scheme  is 
smaller  than  or  equal  to  the  one  obtained  with  the  CER  scheme.  Hence 
the  average  path  with  an  OBR  is  smaller  than  or  eaual  to  the  average 


path  length  with  a  CER. 

TTie  above  proposition  deals  with  averages;  we  now  intend  to  put 
a  bound  on  the  increase  of  the  path  length  between  an  arbitrary  pair 
of  nodes  s,  t. 


Lemma  3.2 


Under  the  previous  conditions  and  Assuirption  3.3,  and  for  the 


CER  scheme 


1%  -  h  <  £  d. 

St  st  -  j 


¥  s,  t  e  same  k  level  cluster  C. 


¥k  =  1,  2,  ...,  m 


(3.29) 
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I 


I 


Proof 


Under  the  above  condition  Eq.  (3.28)  holds  tnie 


h%  =  h^  +  . 

St  se  e  t 
s  s 


Since  e^  ,  t  e  Cj^  j(j)  ,  then  from  Lemma  3.1 


k-1 

hV  <  L  d. 


Also,  because  of  Assumption  3.3,  and  Eq.  (3.19) 


h  ^  =  h^,.  >  h^ 
St  St  —  se 


Combining  the  last  three  relations  together,  we  find  Eq.  (3.29). 

Note  that  lemma  3.2  is  not  true  with  the  OBR  scheme,  because 
such  a  scheme  tries  to  find  the  best  overall  exchange  node  (e^)  to 
route  the  messages  to,  instead  of  using  the  closest  one. 

We  observed  previously  that  Assumption  3.3  is  a  realistic  one, 
but  if  it  is  not  specifically  built  into  the  clustering  algorithm,  there 
is  no  guarantee  that  the  outcome  of  the  clustering  always  satisfies  that 
assumption.  This  remark  leads  us  to  the  following  proposition. 


Proposition  3. 3 

Under  the  conditions  of  Proposition  3.2  and  with  Assunption  3.3 

removed. 


Proof 


h 

c 


m-1 


h  <  L 
k-1 


(3.30) 


Let  s,  t  be  an  arbitrary  pair  of  nodes  and  let  C.  be  the  lowest 


level  cluster  to  which  s,  t  belong.  TVro  cases  to  consider  are 
i.  k  <  m.  From  Lemma  3.1 

k 


St 


Hence 


IE  d. 
j=l  J 

c  c 

1  ^  -  h  ^  <  h*^^  <  52  d.  <  F  d. 

St  St  —  St  —  1  —  .'H  1 

3=1  ■'  3=1 


m-1 


(3.31) 


ii.  k  =  m.  represents  the  entire  set  of  nodes,  hence 


hE  =  h  ^ 

St  St 


Then  similar  to  the  proof  of  Proposition  3.2,  we  can  show  that  Eq. 
(3.27)  holds  true  for  k  =  m,  i.e., 

m-1 


Z  ChJt  -  S  (n.  -  1) 


teCm(s) 


m 


l"l''2---Vl  ■‘j 

3=1  ■> 


32) 


Because  of  Eq.  (3.31)  and  Assumption  3.1 

X  '"st  -  s  ■'j 


m-1 


(3.33) 


teC 


Substituting  Eqs.  (3.32)  and  (3.33)  into  (3.14)  we  arrive  at  Eq.  (3.30) 


Conclusion 

In  summary,  several  fairly  general  bounds  have  been  derived, 
depending  on  the  assumptions  and/or  the  routing  schemes  selected.  In 
the  next  paragraph  we  will  study  the  behavior  of  some  of  those  bounds 
in  the  context  of  a  defined  class  of  networks.  Let  us  also  show  that 
for  the  degenerate  case  of  1-level  hierarchical  routing  (NCR  scheme) 
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the  bounds  derived  over  the  increase  in  path  length  are  tight;  i.e., 
for  m  3  1,  Eqs.  (3.13)  and  (3.30)  become 


h  -  h  <  0 
c  — 

but  since  h  -  h^0=>h  =h.  Also  Eq.  (3.29)  becomes  h*"  =  h  . 

3. 3  Static  Performance  Evaluation  of  the  ftlR  Schemes  for  a  Family 
of  Networks 

If  temporarily  we  do  not  take  into  account  the  significant 
gains  obtained  in  reducing  the  CPU,  storage  and  line  utilizations 
required  by  the  routing  procedures,  then  the  application  of  the  ftlR 
schemes  will  result  in  a  degradation  of  the  performance  of  the  network, 
as  compared  to  the  utilization  of  a  non-clustered  scheme.  Ihe  loss  in 
performance  (delay,  throughput)  is  closely  related  to  the  average  path 
length  a  message  follows  in  the  network.  The  evaluation  of  the  increase 
in  path  length  provides  us  with  a  first  cut  modeling  of  the  loss  in 
network  performance.  Moreover,  the  study  of  the  bounds,  derived  pre¬ 
viously,  represents  a  worst  case  evaluation  of  the  mR  schemes.  Since 
the  evaluation  is  in  terms  of  path  length,  we  will  refer  to  it  as 
static  performance  evaluation.  The  gains  obtained  are  modeled  by  the 
single  variable  A/N  which  represents  the  reduction  of  routing  infor¬ 
mation  (we  will  refer  to  A/N  as  the  relative  table  length).  The  static 
performance  evaluation  is  performed  over  a  class  of  con^)uter  networks. 

3.3.1  A  Family  of  Large  Distributed  Networks 


upon  which  it  is  possible  to  fit  an  m- level  hierarchical  clustering 
whose  outcome  satisfies  Assunptions  3.1  -  3.3.  Also  the  resulting 
cluster  subnets  at  any  level  are  of  diameters  bounded  by  a  power  law 


function  of  the  number  of  nodes  in  that  cluster;  i.e.,  if  n  is  the  size 
of  a  cluster  and  d  the  diameter  of  that  cluster's  subnet  then 


d  <  bn  +  c 


(3.34) 


where  b,  c,  v  are  positive  parameters  and  0  £  v  £  1  (see  below). 

If  N  is  the  size  of  such  a  network,  then  the  average  path 
length  (hop  distance)  of  that  network,  h,  must  be  a  power  law  function 
of  N, 


h  =  aN 


(3.35) 


where  a  is  a  positive  parameter. 

Grid  type  networks  [BARA  64],  hexagonal  networks,  etc.,  fall 
into  that  category  when  the  MHC  results  in  subnetworks  of  similar  struc¬ 
ture  as  the  original  and  when  the  path  lengths  are  expressed  in  hops. 
Expressions  for  the  average  path  length  (with  a  uniform  traffic  matrix 
=  Y)  and  for  the  diameter  of  the  grid  and  the  torus  networks  have 
been  derived  in  Appendix  A.  Some  of  the  results  obtained  are: 


Square  grid  of  size  N 


Square  torus  of  size  N, 


(  d  =  2  /n"  -  2 

.  /a 


(with  y/n  odd)  '  d  =  v/lT  -  1 


(3.36) 


(3.37) 
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Furthermore,  if  the  partitioning  of  either  the  square  grid  or 
torus  networks  results  in  grid  cluster  subnets  at  all  levels  (see  Fig. 
5.6),  then  for  any  cluster  subnet  of  size  n  its  diameter  d  is  such  that 

d  <  2/n  -  2  (3.38) 

As  a  consequence  the  grid  and  torus  networks  fit  the  above  descriptions. 
Note  also  that  for  those  networks  the  exponent  v  (Eqs.  (3.34)  and 
(3.35))  is  equal  to  1/2. 

In  general,  the  exponent  v  reflects  the  connectivity  of  the 
network  considered.  For  very  highly  connected  networks  v  is  in  the 
neighborhood  of  zero;  e.g.,  for  a  fully  connected  network  v  =  0 
(h  =  1,  d  =  1).  Whereas  for  veiy  low  connected  networks  v  is  in  the 
neighborhood  of  one;  e.g.,  for  loop  or  chain  type  networks,  v  =  1. 

Computer  communication  networks  fall  into  the  class  of  dis¬ 
tributed  networks.  This  class  includes  networks  such  as  the  ARPANET, 
the  NPL  network  [DAVI  73],  the  Cyclades  network  [ZIMM  75],  TELENET, 
etc.  The  main  characteristic  of  those  distributed  networks  is  their 
low  connectivity.  In  general  a  connectivity  2  (or  3)  is  imposed  on 
their  design.  For  large  distributed  networks  a  connectivity  of  3  to  4 
seems  more  appropriate  [  NAC  73].  The  torus  networks  considered  above 
are  of  connectivity  4  and  with  an  exponent  v  =  1/2,  hence  they  appear 
to  be  good  representatives  of  large  distributed  networks.  Moreover, 
their  topological  structures  lead  to  simple  partitions  such  as  square 
subgrid  clusters.  In  the  sequel,  we  will  first  derive  a  limiting 
result  valid  for  the  entire  class  of  networks,  then  we  will  restrict 
our  considerations  to  values  of  a,  b,  c,  v  as  obtained  for  a  torus. 
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3.3.2  Limiting  Performance  of  the  KHR  Schemes 

As  mentioned  earlier,  the  static  performance  evaluation  of  the 

I 

^t^R  schemes  involves  the  determination  of  a  bound  on  the  relative 
increase  in  the  average  path  length.  For  the  above  family  of  networks, 
the  size  of  a  level  cluster  is  njn2...nj^  and  because  of  Eq.  (3.34), 

dj^  =  b(njn2. .  .nj^)^  +  c  k  =  1,  2,  ...»  m  (3.39) 


This  family  of  networks  also  satisfies  Assumptions  3.1  -  3.3,  hence 
Proposition  3.2  holds  true.  From  Eqs.  (3.10),  (3.13)  and  (3.39) 

V 


At  A 

c  A 

D  =  ^-  1  <E  = 


a(N  -  1)N  k 


?!  ■  A  "‘)('’(A "‘)  *  j 


(3.40) 


Notice  that  E  is  defined  as  the  bound  on  D.  It  is  the  behavior  of 
E  versus  l/H  that  we  are  interested  in.  TVto  cases  will  be  distinguished 
in  the  study  of  E;  first  when  the  MHC  results  in  a  minimum  table  length, 
Eq.  (2.6),  second  when  the  mc  results  in  a  vector  minimizing  the  bound 
E  for  a  given  table  length  £. 

For  an  optimal  clustering  structure  we  know  from  Proposition 
2.1  that  the  degree  vector  must  satisfy  Eq.  (2.5).  Substituting  Eq. 
(2.5)  into  Eq.  (3.40),  we  arrive  at 


E  =  - - - -  Fn  -  +  cl 

a(N  -  1)N  k=l 


and  if  we  restrict  v  to  be  different  from  zero,  then  after  some 
algebra 


Notice  again  that  for  m  =  1,  E  =  0. 

Also  from  Eq.  (2.6),  the  relative  table  length  is 

i  _ 

N  ■  N 


(3.42) 


The  above  considerations  lead  to  the  general  limiting  result  below. 


Proposition  5.4;  Limiting  Performance 

Consider  the  above  family  of  networks  and  the  above  NWR  schemes 
(OBR,  CER)  with  a  fixed  number  of  levels  m  and  an  optimal  clustering 
structure.  Then,  as  N,  the  number  of  nodes,  goes  to  infinity,  the 
"static"  performance  of  the  WR  schemes  approaches  that  of  a  non¬ 
clustered  routing  scheme,  while  the  relative  table  length  approaches 
zero;  i.e.. 


(3.43) 


Thus  we  claim  we  can  get  the  best  of  both  possible  worlds. 


Proof: 

From  Eq.  (3.42),  the  above  limit  of  VN  is  obvious  (recall 


that  m  is  fixed).  Also  from  Eq.  (5.3),  it  is  obvious  that 


hence 


h  >  h  >  0 
c  —  — 


Then  because  of  Eq.  (3.40) 


-r* 


0  1  iP  '  11^ 


(3.44) 


As  a  consequence,  it  is  enough  to  prove  that  the  limit  of  E  is  zero. 
Expanding  Eq.  (3.41)  around  N  we  find 


E  .  i.  N-*/”  . 

a 


(3.45) 


hence 


limit  E  =  0 
N-Ko 


Q.E.D. 


Notice  that  the  closer  v  is  to  one  (v  /  0) ,  the  faster  is  the 
convergence  of  E  to  zero.  In  other  words,  as  could  be  expected,  the 
more  distributed  (the  less  connected)  the  networks  are  the  better  the 
WlR's  perform.  Notice  also  the  enormous  gains  in  table  length  for  a 
relatively  insignificant  increase  in  the  average  path  length. 

The  above  results  hold  true  if  we  relax  Assuii5)tion  3.3  imposed 
over  the  outcome  of  the  clustering  of  any  of  the  networks  which  belong 
to  the  family  considered  here.  To  prove  this  statement  let  F  be  the 
bound  on  the  relative  increase  in  path,  as  obtained  from  Proposition 
3.3,  i.e.,  from  Eq.  (3.30)  and  (3.39), 


h  ,  ,  m-1 

IT- 

"  aN^  k=l 


[b(njn2. .  .nj^)^  +  c] 


(3.46) 


For  an  optimal  clustering,  the  above  equation  becomes. 
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t;-' 


I 


F  = 


aN 


N  -  N 


,v/m 


n''/".  1 


+  c(ni  -  1) 


(3.47) 


Similar  to  E,  as  N  goes  to  infinity,  F  goes  to  zero,  which  implies  that 
h^/h  goes  to  1. 

The  result  of  Proposition  3.4  was  derived  for  a  fixed  m;  let 
us  now  examine  the  situation  where  m  is  variable.  Of  interest  is  the 
value  of  m  which  corresponds  to  the  global  optimum  clustering  structure. 
That  value  is,  from  Eq.  (2.17),  m,^  =  inN. 

Substituting  Eq.  (2.17)  into  Eq.  (3.41),  we  arrive  at 


E*  =  E 


a(N  -  1)N’ 


bN  - - —  +  c(ln  N 

e'^-  1 


-  1)N 


.,1+v  1+v  ,, 

,  N  -  e  N  -  e 

-  b  — 5— - c 


Then 


1+v 


limit  E*  =  - 
N-w  ® 


e  -  1 


e^  -  1 


1+v  , 

e  -  1 


(3.48) 


(3.49) 


As  a  consequence  the  result  of  Proposition  3.4  is  not  necessarily  true 
anymore.  If  we  consider  grid  or  torus  networks,  then  from  Eqs.  (3.36)- 
(3.38), 


2  1 
for  a  grid:  a=y,b=2,  c=-2,  v=j 

for  a  torus:  a=j,b=2,  c=-2,  v=j 


Substituting  the  above  values  into  Eq.  (3.49),  we  arrive  at 


(3.50) 
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I  5.01  for  torus  nets 

limit  li*  =  [  (3.51) 

f  3.V6  for  grid  nets 

Fig.  3.4  illustrates  the  behavior  of  E*  (for  torus)  with  respect  to  N. 

The  curve  shows  E*  as  an  increasing  function  of  N.  It  also  shows  that 
the  cost  of  operating  at  the  (global)  minimum  table  length  may  become 
quite  high  (up  to  6  times  increase  in  path  length) .  Fortunately,  as 
noticed  in  Chapter  2,  most  of  the  table  reduction,  for  practical  pur¬ 
poses,  may  be  obtained  with  m  quite  a  bit  smaller  than  the  global 
number  of  levels  m*,  and  as  we  will  see  in  later  plots  (Figs.  (3.5)  and 
(3.7))  the  cost  at  a  small  m  is  quite  minimal.  Those  figures  also  show 
the  behavior  of  E,^  versus  H/N. 

3.3.3  Static  Performance  Evaluation  for  the  ^t^R's;  Numerical 
^plications 

In  the  previous  section  we  observed  that  at  the  limit  (N  -►  «>) 
considerable  table  reduction  can  be  achieved  with  no  loss  in  performance. 
In  this  paragraph,  we  intend  to  look  at  the  more  general  case  of  a  finite 
N.  The  purpose  is  to  correlate  the  degradation  in  performance  with  the 
table  reduction.  More  precisely,  we  will  evaluate  a  maximum  performance 
degradation  in  terms  of  the  gains  in  table  length.  Also  this  evaluation 
will  be  carried  out  with  an  NWC  which  results  first  in  a  minimal  table 
length  and  second  in  a  minimal  bound  E. 

The  numerical  study  below  is  restricted  to  values  of  a,  b,  c,  v, 
as  •■ined  for  torus  networks  (Eg.  (3.50)).  although  such  a  study 
could  easily  be  repeated  for  other  networks  which  belong  to  the  family 


considered  here. 


3. 3. 3.1  Evaluation  with  an  MIC  Resulting  in  a  Minimal  Table  Length 
The  expression  cf  E  has  already  been  derived,  in  Eq.  (3.41), 
for  an  MIC  which  results  in  a  minimal  table  length  £,  Eq.  (2.6).  If 
we  consider  torus  networks,  then  from  Eq.  (3.50)  E  becomes 


E 


I 

N 


2 

(N  - 


N 


1/2 


N 


,l/2m 


l/2m 
-  1 


N3/"2m  _ 


N  - 

N^/"'  -  1 


mN 


1/m 


N 


(3.52) 


The  above  equations  provide  a  parametric  representation  of  E  as  a  func¬ 
tion  of  J./N.  m  acts  as  the  coupling  variable  in  that  representation. 

By  letting  m  vary  from  1  to  InN  we  obtain  all  the  possible  values  of 
VN;  and  subsequently  for  each  value  of  A/N  we  obtain  the  corresponding 
value  of  E.  The  above  range  of  m  is  chosen  in  accordance  with  the 
results  obtained  in  Chapter  2  (refer  to  Proposition  2.2  and  Fig.  2.5); 
and  also  in  accordance  with  the  fact  that  E  is  an  increasing  function 
of  m  (this  fact  is  obvious  from  the  proof  of  Proposition  3.2). 

Numerical  results  are  presented  in  a  set  of  figures  as  follows: 
Fig.  3.5  illustrates  the  behavior  of  E  with  respect  to  A/N  and  for 
several  values  of  N.  We  observe  that  an  original  substantial  table 
reduction  can  be  achieved  for  small  values  of  E,  i.e.,  for  a  small  drop 
in  performance.  However  if  we  try  to  reduce  A/N  to  values  close  to  its 
global  minimum,  Eq.  (2.19),  then  E  increases  sharply.  Fig.  3.5  also 
illustrates  the  limiting  behavior  of  the  MIR  schemes  (see  Proposition 
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3.4)  whereby  as  N  becomes  larger,  more  gains  can  be  obtained  for  a 
lesser  loss  in  performance.  This  property  is  shown  by  the  fact  that 
the  curves  E  versus  i/ti  stay  flat  on  the  A/N  axis  for  larger  intervals. 
The  linear  scale.  Fig.  3.6,  provides  a  better  illustration  of  the  above 
phenomenon . 

Figs.  3.7  -  3.8  show  the  behavior  of  1/(1  +  E)  with  respect  to 
A/N.  The  function  1/(1  +  E)  represents  a  lower  bound  over  h/h^  (see 
Eq.  (3.44)).  These  figures  exhibit  properties  similar  to  the  previous 
ones. 

Finally,  Fig.  3.9  shows  how  nuch  table  reduction  can  be  ob¬ 
tained  for  a  given  "tolerance"  E  as  a  function  of  the  size  N.  The 
concentration  of  the  curves  for  1  ^  E  ^  5  (Note  from  Eq.  (3.51)  that 
E  s  5  is  the  maximun  error  for  torus  networks.)  again  shows  that  beyond 
a  certain  point  the  gains  in  table  length  can  only  be  achieved  at  the 
expense  of  large  losses  (large  E).  However  in  the  range  0  to  1  for  E 
considerable  gains  can  yet  be  obtained.  For  that  range  of  E  as  shown  in 
Fig.  3.10,  the  corresponding  range  of  the  number  of  levels  m  is  limited 
to  fairly  small  values.  The  curves  in  Fig.  3. 10  represent  the  values  of 
m  which  are  computed  from  Eq.  (3.52)  for  a  given  E  and  N,  and  which 
served  to  find  the  value  of  A/N  for  a  given  tolerance  E  (see  Fig.  3.9). 
The  fact  that  m  is  considered  to  be  a  real  variable  will  be  inteipreted 
in  the  next  section. 

Moreover,  in  Section  2. 3. 2.4,  we  noticed  that  most  of  the 
table  reduction  is  obtained  for  small  values  of  m.  We  conclude  that 
the  MHR  schemes  operating  with  a  small  nun^er  of  levels  2  £  m  £  4 
yield  substantial  table  reduction  for  a  relatively  small  increase  in 
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path  length. 


The  above  study  considered  MiR  schemes  (OBR,  CER)  based  on 
clustering  structures  which  yield  minimal  table  length.  The  question 
arises  as  to  the  existence  of  a  clustering  structure  which  minimizes 
the  increase  in  path  length. 


3. 3. 3. 2  Minimal  Botaids 

In  this  section  we  address  the  problem  of  choosing  a  cluster¬ 
ing  structure  (i.e.,  n,  m)  which  minimizes  the  increase  in  path  length 
for  a  given  H  and  N.  Since  we  have  no  analytic  expression  for  the 
increase  in  path  length,  we  will  limit  our  considerations  to  minimizing 
the  bound  E  (or  F) .  In  order  to  simplify  the  problem  we  will  also 
assume  a  partitioning  of  equal  size  clusters  at  any  level.  Thus  for  a 
fixed  m,  the  optimization  problem  is 


/Given: 

N, 

A,  m  (integer) 

1  min: 

E 

(see  Eqs.  (3.40)and  (3.50)) 

1  over: 

n 

real -valued  vector 

y  s.t. : 

m 

E 

k-l 

<l 

(3.53) 


n  "k  •  N 

k-l 


For  the  abo  problem  to  be  feasible,  the  given  table  length  must 
satisfy 

I  >  (3.54) 
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which  is  a  straightforward  consequence  of  Proposition  2.1.  In  what 

follows  we  assuaw  that  i,  always  satisfies  Eq.  (3.54). 

Problem  3.53  can  be  easily  solved  numerically  for  m  -  2  and 

m  >  3;  beyond  that  it  becomes  quite  complicated  and  no  effort  has  been 

ejqpended  in  that  direction.  However,  as  noticed  earlier,  m  »  2,  3  will 

c^>ture  most  of  the  realistic  table  reductions  for  a  fairly  large  range 
6  7 

of  N  (N  up  to  10  ,  10  ) .  IVfo  cases  to  consider  are 
m  a  2 

Ihe  objective  function,  Eq.  (3.40)  and  Eq.  (3.50),  is 

^2  *  - i-jTx  (N  -  n  )  (n^^  -  1)  (3.55) 

Since  the  bound  E  is  a  decreasing  function  of  IL  (see  Fig.  3.5)  then  at 
optimality  the  constraint  n^  ♦  n^  £  &  is  tight.  Hence  n^  n^  are  such 
that 

I  "l”2  ■  " 

whose  solution  is  the  pair 

(3.56) 

Finally,  the  optimal  solution  can  be  numerically  obtained  by  choosing 
the  pair  n^,  n^  which  minimizes  E2  and  satisfies  Eq.  (3.56). 

Fig.  3.11  shows  the  plots  of  the  minimum  E^  (denoted  by  E^  on 
the  graph)  with  respect  to  l/H  and  for  a  set  of  values  of  N.  Also 
represented,  is  the  botnd  E,  Eq.  (3.52),  obtained  with  a  vector  n  which 
satisfies  Proposition  2.1  (i.e.,  n^^  »  k  »  1,  ...,  m).  The  main 
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observation  is  that  E  is,  in  general,  a  slightly  larger  bound  than 

For  large  values  of  A/N  or  N,  E2  and  E  are  very  small  and  close  to  each 

-1/2 

other.  They  are  equal  at  A/N  =  2N  '  which  corresponds  to  the  mini¬ 
mum  table  length  at  m  »  2  (n^  =  n2  =  n/n)  . 

m  =  3 

Similar  to  the  above,  at  optimality  the  first  constraint  is 
tight,  hence 


"1  ^  ”2  *  ”3  “  ^ 


(3.57) 


Then  for  any  feasible  vector  n  and  for  a  given  n^  >  0,  the  pair  n^,  n2 
(or  n2,  nj^)  must  be  eqtial  to 


1. 

2 


J 


(A 


■n,)- 


(3.58) 


Also  the  expression  of  the  bound  for  a  feasible  N  is  (Eqs.  (3.40), 


(3.50)  and  (3.57)), 


"3  = 


(N  -  1)N 


172 


(N 


nj)(n}/2-  1) 


(3.59) 

Note  that  in  Eq.  (3.59),  n2  was  replaced  by  N/n^n^.  Then  for  a  given 
n^,  the  minimum  of  Ej  can  be  found  by  evaluating  Ej  for  the  two  possible 
values  of  n^,  Eq.  (3.58).  Afterward  we  minimize  the  result  over  n^. 

The  results  of  this  entirely  numerical  procedure  are  shown  in  Fig.  3.12, 
(where  the  minimum  is  denoted  by  Ej)  Fig.  3.12  also  shows  the  bound  E 
Eq.  (3.52)  for  a  continuous  m  (1  <  m  <  3).  The  curves  for  the  minimum 
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BOUNDS  ON  RELATIVE  INCREASE  IN  PATH  LENGTH 


Figure  3.12.  Reletive  Boundt  on  the  Increase  in  Path  Length  with  a  Continuous  and  a 
Discrete  m,  1  <  m  <  3. 
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are  also  shown  here. 

Notice  that  E  =  Ej  for  £/N  »  — —  which  corresponds  to  the 
minimum  table  length  for  a  3-level  hierarchy.  Similar  observations  as 
above  apply  here.  In  general  for  a  given  l/H,  E  is  slightly  worse  than 
E2  and  E^,  except  for  large  values  of  H/N  (i.e.,  small  table  reduction) 
where  all  the  curves  are  quite  close  to  each  other,  with  sometimes  E 
even  a  bit  better.  Also,  the  lower  envelope  of  and  E^  corresponds  to 
the  overall  minimum  of  Problem  3.53  where  m  is  restricted  to  either 
2  or  3. 

Ihe  above  conparison  of  E  to  E^  and  E^  leads  us  to  the  follow¬ 
ing  in^ortant  remark  as  to  the  utilization  of  a  continuous  number  of 
levels  m. 

Remark;  Continuotis  m 

Ihe  fact  that,  in  general,  E  is  slightly  worse  than  E^  or  E^ 
will  allow  us  to  always  use  an  MIC  idiich  results  in  a  minimal  table 
length  (i  =  nW^^"*)  and  whose  number  of  levels  is  continuous,  and  still 
evaluate  a  worst  case  performance  of  the  MIR  schemes.  In  other  words, 
in  the  sequel  the  MiR  schemes  will  always  be  characterized  by  E  and  £. 
as  given  in  Eq.  (3.52). 

A  non-integer  value  of  m  may  be  interpreted  as  explained  in 
the  following  exanple.  Assume  that  we  need  to  operate  a  network  of 
size  N  with  an  MIR  but  with  an  allowed  maximum  increase  in  path  length 
equal  to  E^.  For  those  values  of  E^  and  N  we  can  find  the  minimum 

table  length  (as  in  Fig.  3.9)  and  the  corresponding  number  of  levels 
l/mQ 

”*0*  ^0  *  "'0^  ■  assume  that  =  1.5;  then  we  can  choose  a 
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number  of  levels  m  =  2  and  a  vector  n  which  minimizes  (Problem  3.S 
with  m  =  2  and  i  =  Aq) .  The  minimum  value  of  is,  in  general, 
slightly  better  than  E^;  hence  the  new  mC  with  a  discrete  m  satisfies 
the  maximum  tolerance  allowed. 

The  above  remark  is  only  true  for  1  £  m  <  3  but  as  noticed 
previously,  that  range  of  m  yields  most  of  the  table  reduction  for 
practical  purposes.  Also,  from  the  shape  of  the  curves  in  Fig.  3.12, 
it  appears  that  these  results  could  be  extended  to  hi^er  values  of  m. 

3.4  Conclusion 

In  this  chapter  we  examined  the  effect  of  hierarchical  routing 
on  network  path  length.  Bounds  were  derived  to  evaluate  a  maximum 
increase  in  path  length  for  a  givm  table  reduction.  Furthermore,  the 
bounds  demonstrate  that  no  significant  increase  in  path  length  need  be 
incurred  in  the  limit  of  a  very  large  network. 

Ihe  reduction  in  table  length  means  that  more  channel  capacity 
and  storage  are  available  for  the  transmission  of  data  traffic  in  the 
network.  However,  those  gains  were  obtained  at  the  expense  of  longer 
paths  in  the  network.  It  is  then  natviral  to  evaluate  the  performance 
of  the  hierarchical  routing  in  terms  of  delay  and  throughput,  and  to 
define  the  region  of  N  where  clustering  becomes  economical.  Ihese 
questions  are  the  object  of  the  next  chapter. 
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CHAPTER  4 


STOCHASTIC  PERFORMANCE  EVALUATION 
OF  THE  HIERARCHICAL  ROUTING 


The  previous  chapters  (2,  3)  demonstrated  that  enormous  table 
reduction  can  be  achieved  through  hierarchical  routing.  A  shortcoming 
of  those  gains  was  found  in  the  increase  of  the  network  path  length. 
This  means  that  the  nodal  storage  and  channel  capacity  recovered  as  a 
result  of  the  table  reduction  may  have  to  be  partially  or  completely 
(or  more)  paid  back  to  handle  the  excess  traffic  caused  by  longer  net¬ 
work  paths.  In  this  chapter  we  are  interested  in  the  trading  relations 
among  the  table  reduction,  the  nodal  storage,  the  channel  capacity,  the 
network  size,  the  throughput  and  the  delay.  Several  queueing  models 
are  developed  to  capture  and  exhibit  the  interrelationships  among  these 
variables.  The  models  demonstrate  that  for  some  reasonable  cost  and 
performance  constraints  and  for  a  class  of  symmetrical  and  distributed 
networks,  the  non-hierarchical  routing  becomes  infeasible  for  N  (net¬ 
work  size)  beyond  some  "critical"  value;  whereas,  on  the  other  hand, 
they  show  that  hierarchical  routing  operating  with  an  appropriate  table 
length  is  capable  of  maintaining  a  fairly  good  network  performance  for 
fairly  large  values  of  N. 

In  what  follows,  first  we  present  the  analysis  of  network  per¬ 
formance  under  fixed  routing,  and  a  fundamental  assumption  upon  which 
we  will  build  the  models  mentioned  above.  This  assumption  typically 
results  in  a  mapping  of  hierarchical  and  non-hierarchical  adaptive 
routing  into  deterministic  (fixed)  routing.  Then  four  models  will  be 


gradually  developed  to  deal  with  different  subsets  of  the  variables 
above;  listed  by  order  of  coii5)lexity,  they  are: 

Model  with  no  updates  and  no  storage  limitation 
Model  with  iq)dates  and  no  storage  limitation 
Model  with  no  iipdates  and  with  storage  limitation 
Model  with  updates  and  with  storage  limitation 

4. 1  General  Considerations 

The  objective  here  is  to  first  introduce  the  original  model 
developed  by  Kleinrock  [KLEI  64]  for  delay  analysis  in  store-and- forward 
(S/F)  computer  networks.  Then  we  introduce  a  class  of  symmetrical  net¬ 
works  and  a  main  assumption  tdiich  will  serve  as  a  framework  for  the 
four  models  to  be  developed  in  this  chapter. 

4.1.1  Delay  Analysis  in  S/F  Computer  Networks:  Kleinrock*s  Model 
As  described  in  Section  2.1,  in  a  S/F  net,  a  message  enters 

the  net  at  the  source  node,  visits  several  nodes  along  the  communication 
path,  and  leaves  the  net  at  the  destination  node.  The  time  a  message 
spends  at  each  node  is  the  sum  of  nodal  processing  time  (for  routing 
purposes),  of  queueing  time  due  to  interfering  messages,  and  the  trans¬ 
mission  time.  A  very  important  performance  measure  of  an  S/F  net  is 
the  total  average  delay,  T,  a  message  spends  in  the  network.  T  is 


then  defined  as 


where:  *  average  message  rate  from  source  i  to  destination  j 

z.j  *  average  message  delay  from  i  to  j 
Finally  T  is  the  total  input  rate  (throughput). 


i,jeS  ■> 


(4.2) 


A  straightfoiward  application  of  Little's  result  leads  to  the 
e]q>ression  of  T  in  terms  of  the  individual  channel  delays  [KLEI  64], 


NA  \ 

T  =  L  -r  h 

1*1 


(4.3) 


where:  *  average  traffic  rate  on  channel  i 

t^  *  average  nodal  processing  plus  queueing  plus 
transmission  on  the  i^^  channel. 

Unfortunately,  we  are  not  able  in  general  to  evaluate  t^^  and 
However,  if  we  make  the  following  assumptions  [KLEI  64]; 

a.  external  Poisson  arrivals 

b.  exponential  message  length  distribution  (identical  for 
all  messages) 

c.  single  packet  messages 

d.  error  free  channels 

e.  no  nodal  delay 

f.  independence  assumption 

g.  deterministic  routing 

h.  infinite  nodal  storage 

With  such  assumptions  (to  be  discussed  later),  the  S/F  net 
can  be  modelled  as  a  network  of  queues  of  the  Jackson  type  [JACK  57]. 
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In  particular  each  queue  behaves  as  an  independent  m|m|1  queue  and  the 


average  delay  on  channel  i,  t^,  is  given  by 


where:  »  average  message  length  [kbits/messg] 

*  capacity*  of  channel  i  [kbits/sec] 


(4.4) 


Ihe  average  rates  i  =  1,  ....  NA,  can  be  easily  confuted,  given 

the  underlying  deterministic  routing. 

The  substitution  of  Eq.  (4.4)  into  Eq.  (4.3)  gives 


,  NA 

T-f  E 


X. 

1 


(4.5) 


uC.  -  X. 

1=1  ^1  1 

A  sinple  relationship  exists  between  the  total  internal  traf¬ 
fic  A,  the  total  external  traffic  F,  and  the  average  weighted  (with 
traffic)  path  length  n. 


n  =  A/r 

(4.6) 

NA 

where  ^  ~  ^ 

(4.7) 

i=l  ^ 

Moreover  in  a  0-Ioad  condition,  i.e.,  -*■  0 

¥  j ,  k  ,  the  total 

average  delay  becomes. 

NA  X./A 

(4.8) 

where  X^/A  is  a  constant  which  depends  only  on  the  routing  scheme. 


* 

Not  to  be  confused  with  the  notation  used  earlier  for  cluster. 


Discussion  of  the  Above  Assumptions 


Discussions  of  the  above  assumptions  can  be  found  in  [KLEI  64], 
[FULT  72],  [GERL  73A],  along  with  some  further  extensions  of  the  above 
model.  In  particular,  we  will  discuss  the  last  three  assumptions  in 
order  to  motivate  some  of  the  models  to  be  considered  in  this  chapter. 

Independence  Assumption;  [KLEI  64],  [FULT  72] 

Ihis  assumption  is  quite  acceptable  as  long  as  the  network 
does  not  contain  long  chains  with  no  interfering  traffic. 

Deterministic  Routing  Assumption; 

As  mentioned  in  Section  2.1.1,  typically,  a  network  is  designed 
using  deterministic  routing  [GERL  73A]  and  then  operated  using  adaptive 
routing.  Consequently  the  validity  of  this  assumption  depends  criti¬ 
cally  on  the  difference  of  behavior  between  deterministic  and  adaptive 
techniques.  A  thorough  investigation  on  this  subject  can  be  found  in 
[FULT  72].  Fultz  finds  a  close  agreement  between  the  two  techniques. 

He  showed  in  a  19-node  net  application  that  the  difference  in  delays 
obtained  with  a  well  chosen  adaptive  technique  and  with  a  near- optimal 
deterministic  technique  is  less  than  5  to  10%  almost  until  saturation. 
The  "adaptive"  delay  tends  to  be  hif^er,  mainly  because  of  the  line 
overhead  utilized  by  the  update  traffic. 

Fultz's  study  is,  of  course,  dependent  on  the  particular 
adaptive  policy  and  network  considered.  However,  it  demonstrates  that 
adequate  adaptive  policies  can  be  devised  to  achieve  a  performance  very 
close  to  that  of  a  near-optimal  deterministic  policy. 

Further  refinement  of  the  above  assumption  can  be  realized  by 
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including  the  line  overhead  due  to  the  update  traffic.  This  consider¬ 
ation  will  become  crucial  when  dealing  with  large  nets,  as  we  will  see 
later. 


Infinite  Storage  Assumption 

Nodal  storage  limitation  can  cause  blocking  and,  consequently, 
retransmission  (ARPANET)  or  loss  of  messages,  hence  an  increase  in  delay 
or  a  decrease  in  throughput.  Moreover,  it  can  lead  to  serious  network 
degradation  and  even  deadlocks  [FULT  72],  [KAHN  71],  [KLEI  74]. 

Zeigler  [ZEIG  71]  investigated  some  theoretical  aspects  related 
to  the  dynamics  of  cliques  of  blocked  nodes  in  a  symmetrical  network 
environment. 

It  is  generally  accepted  [FRAN  70],  [COLE  71],  that  with  rea¬ 
sonable  storage  size,  the  above  assumption  is  fairly  accurate.  However 
this  assumption  becomes  critical  if  the  nodal  storage  becomes  small. 

This  situation  is  very  likely  to  occur  in  a  large  network  environment 
if  the  routing  tables  are  not  reduced  to  a  reasonable  length.  A  model 
is  presented  in  Section  4.4  to  precisely  deal  with  this  question. 


4.1.2  A  Class  of  Symmetrical  Networks 


The  delay  analysis  performed  in  the  above  model  relies  on  the 
knowledge  of  the  input  rates  (A^'s)  to  the  individual  channels  in  the 
network.  As  mentioned  earlier,  the  X^'s  can  be  determined  once  we  know 
the  deterministic  routing  policy  and  the  traffic  requirement.  Moreover, 
if  we  know  the  channel  capacities  then  the  X^'s  can  be  computed  to  lead 
to  a  minimal  delay  T  [FRAT  73] . 
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A  shortcoming  of  the  above  numerical  procedures  is  that,  in 
general,  they  hide  the  interrelationships  existing  among  the  different 
design  variables  (traffic  requirement,  channel  capacities,  network 
topology  and  average  delay) .  Fortunately,  for  some  symmetrical  networks 
(see  below)  a  simple  analytical  relationship  exists  among  the  above 
variables. 

The  class  of  nets  to  be  considered  in  this  chapter  is  composed 
of  all  the  nets  which  belong  to  the  family  of  nets  presented  in  Section 
3.3.1,  and  which  also  satisfy  the  following  properties. 

i.  All  nodes  are  equivalent  with  respect  to  the  topology 
of  the  network.  Hence  they  are  of  equal  degree,  R. 

ii.  All  channels  are  of  equal  capacity  C. 

iii.  All  external  input  traffic  rates  are  equal,  i.e., 

Yj,  =  Y^j,ksS  (4.9) 

AS  an  example,  torus  nets  (see  Appendix  A)  fall  into  this  category. 

The  above  properties  lead  to  a  simple  delay  expression. 

4.1.2. 1  Average  Delay 

For  this  class  of  nets,  the  following  relations  exist: 

Number  of  (simplex)  channels:  NA  =  R  •  N  (4.10) 

Total  external  traffic:  T  *  N(N  -  1)y  (4.11) 

Furthermore,  it  is  obvious  that  with  this  particular  topologi¬ 
cal  structure,  capacity  assignment  and  traffic  requirement,  the  optimal 
flow  assignment  [KLEI  64],  [GERL  73A]  is  a  shortest  path  routing.  The 
selection  of  the  particular  shortest  paths  (in  case  that  more  than  one 
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exists)  must  result  in  perfectly  balanced  flows,  i.e.. 


=  X  i  =  1,  NA  (4.12) 

Consequently  the  network  path  length  n  becomes  the  average  shortest 
path  length  h,  defined  in  Appendix  A,  Eq.  (A.2),  then  from  Eq.  (4.6) 

^  =  h  (4.13) 

Also  the  total  internal  traffic  A  (Eq.  (4.7)),  becomes 

A  =  NA  X  (4.14) 


Combining  the  two  equations  above,  we  arrive  at 


(4.15) 


If  we  let  t  denote  the  average  delay  on  any  channel,  then  the  total 
average  delay  becomes,  Eq.  (4.3), 


T  =  ht 


(4.16) 


Moreover,  as  a  consequence  of  Eqs.  (4.4),  (4.5),  (4.12)  and  (4.15) 


and 


(4.17) 


(4.18) 


This  is  the  result  we  were  aiming  at,  which  simply  relates  the  delay  T, 
the  traffic  T,  the  channel  capacity  C  and  the  network  path  length  h. 

Eq.  (4.18)  shows  that  the  net  is  equivalent  (with  respect  to 
delay)  to  a  single  m|m|1  queue  with  an  input  rate  of  F/NA  and  a  service 


rate  yC/h.  This  observation  leads  to  the  definition  of  the  network 
utilization 


^  *  yC  NA 


(4.19) 


Ihe  above  definition  will  be  used  in  the  rest  of  the  chapter,  mainly 
for  normalization  purposes. 


4. 1.2.2  A  Scaling  Scheme 

Since  the  main  objective  here  is  to  study  routing  in  large 
nets,  it  is  necessary  to  specify  the  structure  of  those  large  nets  with 
respect  to  the  size  N,  in  some  continuous  way.  Any  such  specification 
will  be  referred  to  as  a  scaling  scheme  (strategy). 

As  the  network  grows,  a  main  objective  of  a  scaling  strategy 
could  be  to  maintain  the  same  average  delay  T  for  a  reasonable  increase 
of  the  total  traffic  F  and  of  the  network  cost  (channel  capacity  cost) . 
The  total  traffic  F  may  be  reasonably  assumed  to  increase  linearly  with 
the  number  of  nodes.  Which,  due  to  the  uniform  traffic  condition 
(Yjjj  “  Y)»  is  equivalent  to  assuming  that  the  total  input  rate  per  node 
is  maintained  constant.  Let  g  denote  the  total  input  rate  per  node 
divided  by  the  degree  of  a  node, 

*  ^  t4.20) 

Note  that  Y  will  be  adjusted  in  order  to  maintain  g  constant.  Conse¬ 
quently  and  because  of  Eqs.  (4.10),  (4.11)  and  (4.18), 

F  ■  NRg  »  NAg  (4.21) 
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T  =  (uC/h  -  g) 


(4.22) 


In  order  to  maintain  a  constant  delay  T^,  with  the  above  traf¬ 
fic  requirement,  the  channel  capacity  must  be  such  that:  (Eq.  (4.18)). 


C  = 


(4.23) 


which  says  that  C  must  grow  like  the  network  path  length  h.  If  we  let 
Cq  be  a  constant  capacity,  then  the  appropriate  capacity  scaling  is 


C  =  hCjj  (4.24) 

Notice  that  the  ratio  of  total  capacity  to  the  total  traffic  is  equal 
to  hC-/g.  This  indicates  that  for  a  linear  growth  of  N  and  F ,  the 
capacity  required  per  message  need  only  grow  like  h  (e.g. ,  h  =  for 
torus  nets) . 

From  the  above  considerations  emerge  two  scaling  schemes: 


Primal  scaling: 


Dual  scaling: 


I  C  =  hCo 

jC^hCo 
( r  =  NAg^ 


(4.25) 


(4.26) 


The  outcome  of  the  primal  scaling  is  a  traffic  F  =  NAg  where 
g  =  pCq  -  1/Tq,  and  conversely  the  outcome  of  the  dual  scaling  is  a 
constant  delay  T  =  (yCg  -  gg)'^.  Such  outcomes  will  not  be  true  when 
dealing  with  network  models  which  take  into  account  the  updates  and/or 
the  storage  limitation. 
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4.1.3  Continuous  Modelling  of  the  Hierarchical  Routing 

The  objective  here  is  to  suinmarize  the  key  variables  governing 
the  behavior  of  hierarchical  routing  in  a  continuous  (homogeneous) 
approach. 

Recall  that  a  one  level  hierarchical  routing  (CER  or  OBR)  is 
simply  the  non-clustered  routing  (NCR)  defined  in  Chapter  3.  As  a  con¬ 
sequence  and  throughout  this  chapter  we  will  refer  to  the  three  routing 
schemes  (CER,  OBR,  NCR)  as  hierarchical  routing  schemes  with  the  under¬ 
standing  that  m  =  1  refers  to  the  NCR.  Moreover,  as  observed  from 
previous  considerations,  the  underlying  clustering  structure  is  com¬ 
pletely  defined,  knowing  the  number  of  levels  m.  More  specifically, 
the  value  of  m  determines  exactly  the  length  of  the  table  £.  =  mN^^™; 
subsequently,  we  can  use  the  remark  at  the  end  of  Section  3. 3. 3. 2  to 
discretize  m  and  to  find  the  corresponding  degree  vector  n.  Notice 
that  if  m  s  1  then  £  *  N,  i.e.,  a  full  length  table  is  utilized,  which 
corresponds  to  an  NCR.  To  summarize,  the  underlying  clustering  struc¬ 
ture  of  the  hierarchical  routing  (OBR,  CER,  NCR)  is  characterized  by 
the  single  continuous  variable  m.  For  that  reason  m  will  be  referred 
to  as  the  degree  of  clustering 

Chapter  3  showed  that  the  reduction  in  table  length  from  S,  =  N 
to  S,  =  due  to  the  hierarchical  routing,  is  accoirpanied  by  an 

increase  in  path  length  from  h  to  h^.  h^  is  characterized  by  Prop¬ 
osition  3.1,  which  subsequently  provides  us  with  the  basis  for  an 
algorithmic  confutation  of  h^,  given  a  specific  network  and  a  specific 
NHR  scheme.  Fortunately,  the  bounds  derived  in  Chapter  3  will  allow 
us  to  undertake  a  worst  and/or  best  case  analytical  performance 
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evaluation  of  the  hierarchical  routing  for  the  class  of  networks  con¬ 
sidered  here.  Recall  that  those  networks  constitute  a  subset  of  the 
family  studied  in  Chapter  3,  hence  we  will  be  able  to  use  the  explicit 
e:q>ressions  for  the  relative  bounds  E  or  F  as  derived  in  Eqs.  (3.41) 
and  (3.47). 

Notice  that,  similar  to  the  above,  the  bounds  rely  on  the 
continuous  behavior  of  m,  and  more  iijportantly,  they  are  tight  at  m  =  1 
These  last  observations  will  allow  us  to  use  the  above  continuous 
approach  in  dealing  with  the  hierarchical  and  non -hierarchical  routing 
schemes.  The  comparison  between  those  two  schemes  sinply  reduces 
the  conparison  of  a  hierarchical  routing  with  m  =  1  (NCR)  to  one  with 
m  >  1  ((XR  or  OBR) . 

In  summary,  knowing  the  degree  of  clustering  m,  the  increase 
in  path  length  is  characterized  by  the  relative  bounds  E  or  F.  In 
general  E  will  be  chosen,  unless  specified  otherwise.  Hence, 

h  <  h^  £  (1  +  E)h  (4.27) 

and  m  *  1  “>  E  =  0 

Even  with  the  above  simple  specifications  of  the  hierarchical 
adaptive  routing  scheme  (i.e.,  m,  E)  the  queueing  analysis  is  still 
far  too  complicated.  Ihis  is  true  for  any  adaptive  scheme  because  of 
its  dynamic  nature.  In  the  face  of  these  difficulties,  we  will  rely  on 
observations  made  about  the  deterministic  routing  assumption  in  Section 
4.1.1  to  justify  the  modelling  of  a  hierarchical  adaptive  routing  by  an 
"equivalent"  deterministic  routing;  hence  the  assumption  below: 


Assumption  4. 1 


a.  Ihe  performance  of  an  adaptive  hierarchical  routing  is 
the  same  as  that  of  a  fixed  routing  policy  whose  routes  satisfy 
Proposition  3.1.  That  is,  the  length  of  the  "fixed  routing"  paths  are 
equal  to  the  minimum  estimated  path  lengths  as  obtained  with  an  MIR. 

b.  Ihe  fixed  routing  specified  above  and  operating  on  the 
class  of  symmetrical  nets  considered  here,  results  in  equal  loads  on 
all  channels. 

Part  (a),  again  motivated  by  an  earlier  remark  on  deterministic 
routing,  will  become  more  accurate  when  we  include  in  the  model,  based 
on  fixed  routing,  the  line  and  storage  utilization  due  to  the  adaptive 
routing.  Moreover,  if  the  main  objective  is  to  compare  hierarchical 
with  non-hierarchical  routing,  then  this  assumption  appears  to  be  quite 
acceptable. 

Part  (b)  is  motivated  by  the  quite  symmetrical  structure  of 
the  networks  considered  here,  and  also  by  the  fact  that  the  main  objec¬ 
tive  of  an  adaptive  policy  is  to  balance  the  flows  over  all  the  channels 
in  the  net. 

Notice  that,  because  of  the  above  assumption,  the  NCR  (MIR 
with  m  3  1)  is  modelled  by  the  shortest  path  fixed  routing  which,  as 
observed  in  Section  4. 1.2.1,  leads  to  the  optimal  flow  assignment  for 
this  class  of  nets. 

Conclusion; 

As  a  result  of  the  above  considerations,  a  hierarchical  routing 
is  characterized  by  the  degree  of  clustering,  m,  and  the  path  length. 


h  ,  and  can  be  n. - ,  - - - 

c  ' 


of  the  same  lengths. 


4.2  A  Queueing  Model  with  No  Updates  and  No  Storage  Limitation 

In  this  section,  we  still  temporarily  relate  the  gains  obtained 
by  hierarchical  routing  strictly  to  the  relative  table  length  8./N.  In 
other  words,  we  consider  an  ideal  situation  where  infinite  nodal  storage 
is  available  and  where  the  line  capacity  used  up  by  the  update  infor¬ 
mation  is  still  negligible.  This  is  similar  to  the  study  in  Section  3.3 
except  that  we  now  intend  to  look  at  the  degradation  of  the  network 
performance  in  terms  of  delay  or  throughput. 

Based  on  the  previous  general  conditions  (Section  4.1)  and 
the  above  remarks,  this  situation  can  be  modelled  by  Kleinrock's  model 
(Section  4.1.1)  where  the  fixed  routing  is  as  specified  in  Assumption 
4.1. 

Ihe  delay  analysis,  for  our  class  of  symmetrical  nets,  is 
reduced  to  the  one  performed  in  Section  4. 1.2.1  except  that  h  is  to  be 
replaced  by  h^.  As  a  result  the  throughput- del ay  relation  becomes 
(Eq.  (4.18)). 


^c  *  pC/h  ~  T  /NA 
c  c 

Notice  the  change  in  notation  from  F  to  F^  and  T  to  T^.  The  subscript 

c  is  added  mainly  to  differentiate  between  clustered  and  non-clustered 

routing  for  later  conparisons.  Ihe  notation  F,  T  will  be  strictly 

reserved  for  the  NCR  and  F  ,  T  for  all  hierarchical  routing;  of  course, 

c  c 
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if  m  =  1  then  T  =  T  and  T  =  T.  Similar  to  Eq.  (4.16)  and  Eq.  (4.21) 
c  c 

we  will  also  let 

=  C4.29) 

where  t  and  g  are  now  variables, 
c  c 

Ihe  behavior  of  the  hierarchical  routing  may  now  be  studied 
assuming  either  the  primal  or  dual  scaling  schemes  (see  Section  4. 1.2. 2) 
in  order  to  specif  the  growth  of  our  class  of  nets. 

The  numerical  applications  throughout  this  chapter  will  assume 
the  coefficients  a,  b,  c,  v  as  obtained  for  torus  nets  (Eq.  3.50). 

4.2.1  Degradation  in  Throughput  at  Constant  Delay  (Primal  Scaling) 
Recall  that  the  primal  scaling  (PS)  is  characterized  by 
C  =  hCg  and  T  =  *  Tq  (Eq.  (4.25))  and  also  for  an  NCR  h^  =  h,  hence 

from  Eq.  (4.28)  the  ratio  of  throu^puts  with  and  without  clustering  is 

-  i/T„ 

T"  ■  “S  -  "^'^0 

from  the  above  expression  the  effect  of  clustering  can  be  seen  in  the 
reduction  of  the  line  capacity  by  a  fraction  h/h^. 

We  can  now  directly  apply  Proposition  3.4  to  state  a  similar 
asymptotic  result.  Namely,  as  N,  the  number  of  nodes,  goes  to  infinity 
the  throughput  at  constant  delay,  obtained  with  an  MIR  (CER,  OBR)  with 
a  fixed  m,  approaches  that  of  an  NCR  while  the  relative  table  length 
(H/N)  approaches  zero. 

As  for  the  continuous  behavior  of  T  /V  versus  i/N  we  use 

c 


Eq.  (4.27)  to  derive  the  bounds  below. 


(4.30) 


MC, 


1  ♦  E 


LB(r^/r) 


UCp  -  _ 


T«  r 


LB(  )  is  used  to  denote  the  lower  bound  on  the  variable  in  parenthesis; 
similarly,  UB(  )  will  be  used  to  denote  an  iqjper  bound. 

Let  us  examine  the  behavior  of  T^/T  by  plotting  its  bounds 
given  in  Eq.  (4.30).  Recall  that  throughout  this  chapter  Eq.  (3.52), 
giving  E  and  Jl/N,  will  be  used  in  the  numerical  examples.  Ihis  results 
in  the  curves  shown  in  Figs.  4.1  -  4.3.  The  interpretation  of  those 
polts  is  quite  similar  to  that  provided  for  Figs.  3.5  -  3.9  in  Section 
3.3.3.  We  reemphasize  the  fact  that  initial  (substantial)  table  gains 
(small^  m)  can  be  obtained  with  a  relatively  small  degradation  in 
throu^put.  Whereas  larger  table  induction  (large^  m)  may  drive  the 
lower  bound  to  zero.  The  asymptotic  property  is  nicely  illustrated  in 
the  sharp  behavior  of  the  lower  bound,  for  a  large  N,  in  Fig.  4.2. 

Also,  Fig.  4.3  shows  that  as  N  increases  the  cost  incurred  with  a  fixed 
relative  table  length  K/N  goes  to  zero. 

We  note  also  that  the  curves  in  Figs  4.1  and  4.2  meet  at  the 
point  A/N  =  1,  LB(r^/r)  =  1.  That  point  corresponds  to  m  =  1  where 
the  bound  is  tight. 


Recall  from  Qiapter  2  that  most  of  the  table  reduction  is  obtained  for 
small  values  of  m,  and  the  remaining  reduction  up  to  the  global  minimum 
table  length  is  obtained  with  quite  a  bit  larger  m  (iq)  to  m«). 
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Figur*  4.2. 


4.2.2 


Degradation  of  Delay  at  Constant  Throughput  per  Node  (Dual 
Scaling) 

We  now  intend  to  study  the  dual  behavior  by  applying  the  dual 

scaling,  whose  purpose  is  to  maintain  r/NA  constant  and  to  let  T  vary. 

c 

Let  g  =  gp  =  gQ  =  r/NA  and  C  =  hC^;  then  the  ratio  of  delays  T^/T  is 
from  Eq.  (4.28) , 


-  h 
ir  ^^o  -  * 

c 


(4.31) 


Immediately  we  obtain  the  dual  asymptotic  result  whereby  as  N  goes  to 
infinity  the  network  delay  obtained  with  an  ^flR  (m  fixed)  approaches 
that  of  an  NCR,  while  the  relative  table  length  goes  to  zero. 

As  for  the  general  behavior  of  T^/T  versus  A/N,  from  Eq.  (4.27) 


<^<ub(^) 


MC, 


0 


0 


VCr 


(4.32) 


1  +  E  *0 

The  inverse  of  the  above  bounds  are  plotted  in  Fig.  4.4,  in  order  to 
show  the  behavior  of  T/T^;  dual  properties  are  exhibited.  We  also  note 
that  in  some  regions  pCq/(1+E)  <  g^,  hence  the  corresponding  degree  of 
clustering  m  is  infeasible. 


4. 3  A  Queueing  Model  with  Updates  and  No  Storage  Limitati(m 

Our  previous  models  expressed  the  gains  obtained  from  the 
hierarchical  routing  strictly  in  terms  of  the  relative  table  length 
A/N.  They  left  out  the  further  (significant)  savings  incurred  in  line 
capacity  and  nodal  storage.  As  a  result  of  this  quite  idealistic  sit¬ 
uation  of  abundance  in  c£q}acity  and  storage,  the  NCR  showed  a  better 
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network  performance  than  the  NtlR  except  at  the  limit  of  very  large  nets 
where  they  were  quite  equivalent  even  for  substantial  table  reductions. 
This  last  (asymptotic)  property  makes  us  believe  intuitively  that  in  a 
less  idealistic  situation  (storage  and  capacity  limitations)  the  MHR 
with  the  right  degree  of  clustering  will  eventually  (in  terms  of  size) 
outperform  the  NCR.  Moreover  we  can  already  observe  that  if  no  cluster¬ 
ing  is  introduced,  the  linear  growth  of  a  full  length  table  (N  entries) 
with  N  will,  at  a  certain  point,  use  up  all  the  nodal  storage  and  the 
line  capacity  (for  the  exchange  of  updates),  hence  driving  the  network 
throughput  to  zero. 

In  this  section  we  intend  to  account  for  the  traffic  generated 
by  the  routing  updates  (in  the  evaluation  of  the  X^'s)  while  keeping 
the  infinite  storage  assunption. 

As  noticed  earlier,  the  average  delay  in  our  class  of  symmet¬ 
rical  nets  is  very  simply  related  to  the  average  delay  at  any  channel 
(T  =  ht) ;  therefore,  we  will  first  analyze  a  single  channel  and  then 
generalize  to  the  whole  net. 

4.3.1  Priority  Model  for  a  Channel 

A  simple  and  realistic  Head  £f  £ine  (HOL)  model  [KLEI  76]  is 
considered  here,  mainly  to  capture  the  effects  of  updates  on  the  average 
time  spent  by  a  data^  message  waiting  to  be  transmitted  on  a  channel. 

We  assume  that  updates  are  originated  at  regular  intervals  of  time 
(motivated  by  ARPANET),  periodic  updates  may  be  modelled  by  a  "no 

^  A  data  message  is  differentiated  from  an  update  message. 
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update"  model  (Sections  4.2,  4.4)  or  by  a  certain  distribution  governing 
their  generation  times.  The  latter  possibility  can  be  easily  included 
in  the  model  below  for  such  distribution  as  the  Poisson  distribution. 
(However,  no  work  has  been  done  in  that  direction.) 

In  summary,  our  model  for  a  channel  consists  of  a  single  queue 
operated  with  a  HOL  priority  discipline  and  the  following  traffic 
characteristics. 

i.  i^date  traffic;  Deterministic  arrival  process  of 
rate  X^.  Constant  message  length  1/p^  [kbt/messg] . 

ii.  Data  traffic:  Poisson  arrival  process  of  rate  X. 
Exponential  message  length  of  mean  l/y  [kbt/messg]. 

iii.  Queue  discipline;  HOL  preeiq>tive  resume  between  data 
and  update  traffic,  with  a  higher  priority  for  updates. 
FCFS  (first-come-first-serve)  within  each  priority. 

iv.  Channel  capacity:  C  [kbt/sec]. 

Ihe  "preenptive  resume"  assumption  in  (iii)  is  introduced  to  further 
simplify  the  analysis  of  the  model,  which  otheiwise  becomes  quite 
coiqilex. 

The  above  model  is  slightly  different  from  the  one  analyzed  by 
Kleinrock  [KLEI  76]  where  he  COTsiders  the  arrival  processes  of  all  the 
types  of  customers  (messages)  to  be  ^vemed  by  a  Poisson  distribution. 
However,  his  methodology  can  still  be  used  here  in  order  to  derive  the 
average  time  in  system  for  a  data  message. 

With  regard  to  the  update  traffic,  it  simply  sees  a  D|D|1 
system;  hence  as  long  as  X^  <  (X^  ^  y^C  means  that  more  than  the 

total  capacity  is  required  by  the  updates,  thus  the  routing  becomes 
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infeasible)  there  is  no  queueing  of  update  messages;  whereas,  an  arriv¬ 
ing  data  message  will  incur  a  delay  from  the  message  (data  or  update) 
already  in  service,  from  data  messages  already  in  the  queue  and  from 
updates  arriving  during  its  system  time  (see  Fig.  4.5) 


ARRIVING 

DATA 

MESSAGE 


1-0 


DATA  OR  UPDATE 


DATA 


Figure  4.5.  State  of  the  Queue  as  seen  by  an  Arriving  Data  Message. 


Let  us  now  evaluate  the  different  conponents  involved  in  the  delay  of  ' 

a  data  message. 

1.  Delay  due  to  messages  already  in  queue.  Let  S  denote  the 
expectation  operator,  n  be  the  number  of  data  messages  (updates  do  not  • 

join  the  queue)  that  our  arriving  data  message  finds  in  the  queue,  and 
be  the  service  required  by  the  i^^  message;  then  the  average  wait 
incurred  is  <?(r^X^).  • 

Clearly,  X^,  X^,  ...,  X^  are  identically  distributed  (expo¬ 
nential  distribution  of  mean  l/p) ;  the  question  arises  as  to  the 

distribution  of  X^^,  the  service  (entire  or  remaining)  time  of  the  data  • 

message  in  the  front  of  the  queue  (see  Fig.  4.5)  which,  as  such,  could 

have  been  preempted  several  times  in  the  past.  Fortunately,  because 

of  our  exponential  distribution  assumption  for  data  messages,  the  • 

remaining  service  time  is  also  exponentially  distributed  no  matter  how 

many  times  that  message  has  been  pijeempted.  Hence  (Z^X^^)  =  n/pC, 
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where  n  is  the  average  number  in  queue  as  seen  by  our  arriving  message. 
Since  data  arrivals  are  Poisson,  then  the  distribution  of  n  is  the  same 
as  the  distribution  of  the  number  of  messages  in  the  queue  at  any  arbi¬ 
trary  time.  Therefore,  if  t  is  the  average  time  in  system  (queueing  + 
service)  for  a  data  message,  then  because  of  the  previous  remark  and 
from  Little's  results,  n  =  X(t  -  l/uC). 

2.  Delay  due  to  the  message  in  service.  Let  Y  be  the 
residual  life  (remaining  service  time)  of  the  message  in  service.  <f(Y) 
is  the  contribution  in  delay  that  we  need  to  evaluate.  Conditioning  on 
the  type  of  message  in  service,  it  is  clear  that 


Hence, 


«?’(Y)  a  <i?[residual  life  of  data  message]  • 


+  S  [residual  life  of  update  message]  • 


*  UC  *  p  C  *  2I  C 


3.  Delay  due  to  updates  arriving  while  in  system.  A  data 
message  spends  an  average  time  t  in  the  system.  During  that  time,  on 
the  average,  A^t  iq)date  messages  arrive  and  get  serviced;  hence  the 
average  wait  time  incurred  is  A^t  l/P^C.  Notice  that  even  though 
update  arrivals  are  deterministic,  their  relative  positions  with  respect 
to  data  messages  are  conpletely  random  due  to  the  Poisson  distribution 
of  data  arrivals. 

Finally  summing  all  the  waits  incurred  by  our  arriving 


message  and  its  own  service  time,  we  arrive  at 


Hence 


2[y^c]2 

^ 
A  U 

gC  '  y  C 
u 


(4.33) 


If  we  set  =  0  in  the  above  equation  (i.e.,  if  we  neglect  updates) 
then  we  arrive  at  the  original  expression  for  t,  Eq.  (4.17).  The 
difference  between  the  two  equations  illustrates  the  effect  of  the 
iq)  dates. 


4.3.2  Network  Model 

Similar  to  the  previous  model.  Assumption  4.1  results  in  equal 
channel  loads  of  data  messages  \  (X  *  h^r/NA) .  Moreover  because  of 
the  periodic  iqidate  assumption,  all  channels  will  receive  an  equal  rate 
of  iqidates  X^. 

With  regard  to  the  delay  analysis  for  our  class  of  symmetrical 
networks,  Eqs.  (4.10)  -  (4.16)  are  still  valid  when  we  replace  h  by  h^. 
Using  a  similar  notation  as  in  Section  4.2  (t,  t  ,  h,  h  ,  T,  T  ,  F,  F  , 
g*  gg) »  we  arrive  at  the  throughput -del ay  relation  below  which  charac¬ 
terizes  the  performance  of  the  hierarchical  routing. 
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(4.34) 
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Recall  that  the  size  of  an  update  message,  l/U^*  is  fixed  and  propor¬ 
tional  to  the  length  Jl  (A  *=  of  the  routing  table.  Hence,  it  is 

of  the  form  1/u  =  el  where  e  is  the  inverse  of  the  number  of  entries 
u 

which  add  to  1  kbt.  As  an  example,  in  the  ARPANET,  an  entry  requires 
16  bits  of  storage,  hence  e  =  .016  =  1/62.5.  For  further  normalization 
with  respect  to  the  average  data  message,  l/y,  we  choose  e  such  that 


4.3.3  Performance  Evaluation  of  the  Hierarchical  Routini 


The  primal  and  dual  scaling  schemes,  Eqs.  (4.25)  -  (4.26)  need 


further  specifications  as  to  the  choice  of  the  update  rate  in  terms 
of  the  network  parameters. 


4.3.3. 1  Scaling  of  X^ 

Let  us  recall  that  the  main  purpose  of  the  routing  updates  is 
to  provide  the  routing  decision  algorithm  with  a  good  estimate  of  the 
state  of  the  network.  Since  at  each  iq>date,  exchange  of  information 
(not  necessarily  synchronized  among  all  nodes)  occurs  only  between 
neighboring  nodes,  then  the  propogation  of  a  change  occuring  in  a  cer¬ 
tain  region  of  the  net  to  another  region  might  require  a  number  of 
updates  equal  to  the  distance  separating  the  two  regions.  Consequently, 
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as  the  network  grows  and  if  we  wish  a  change,  conveyed  through  the 
exchange  of  iq>dates,  to  reach  remote  areas  within  reasonable  time,  then 
it  is  necessary  to  increase  the  update  rate  as  N  increases.  We  may  also 
argue  the  "very"  remote  areas  would  not  be  as  concerned  with  that  change 
as  the  closer  ones,  thus  the  update  rate  probably  need  not  increase 
as  fast  as  N.  A  realistic  coii;)romise  would  consist  in  the  use  of  higher 
update  rates  (as  N  increases)  but  only  to  propagate  less  and  less  infor¬ 
mation  about  a  region  as  we  move  away  from  that  region.  This  remark  is 
again  a  key  motivation  behind  the  hierarchical  routing. 

From  the  above  considerations  emerge  three  possible  specifi¬ 
cations  for  X  . 

u 


i. 

A  = 
u 

'u* 

constant 

ii. 

A  a 

hx". 

\/n 

A^  for  a  torus 

U 

U 

u 

2 

iii. 

A  = 
u 

aN"/2;,0 

n1/4 

2 

•  A®  for  a  torus 

Choice  (i)  represents  a  worst-case  condition  whereby  the 
update  rate  is  insensitive  to  the  increase  in  network  size. 

Choice  (ii)  appears  to  be  more  intuitive  since  the  iq>date 
information  needs  on  the  order  of  h  (average  path)  periods  to  percolate 
throughout  the  net. 

Choice  (iii)  is  a  compromise  between  the  two  above;  it  indi¬ 
cates  that  routing  information  need  not  percolate  as  fast  in  the  entire 

1/2 

net,  but  only  within  a  certain  area  comprising  roughly  N  nodes. 

We  consider  these  three  choices  below. 


4. 3. 3. 2  Throughput  at  Constant  Delay  (Primal  Scaling) 

With  the  primal  scaling,  C  »  hCjj  and  T  *  Tq,  the  network 
throughput  is  (Eqs.  (4.34)  and  (4.35)) 

X 


JL  r  _L  u 

NA  ■  h  ^^0  "  T.  ■  ~K~ 
c  0  c 


u 


(4.36) 


For  any  routing  to  be  feasible,  the  right  hand  side  of  the  above 
equation  must  be  positive. 


Asymptotic  Behavior 

As  the  number  of  nodes  goes  to  infinity  and  under  the  con¬ 
straint  on  X^  below  (Eq.  (4.37)),  for  a  hierarchical  routing  to  be 
feasible,  its  number  of  levels  m  must  be  greater  than  or  equal  to  a 
certain  value,  m^.  Moreover,  the  asynptotic  throughput  (Eq.  (4.38)) 
shows  that  at  the  limit,  with  any  feasible  hierarchical  routing 
(m  fixed),  the  effect  of  the  updates  on  the  channel  utilization  becomes 
negligible. 

The  condition  on  X  is  such  that  there  exists  m_  which 
u  0 


satisfies 


limit 
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=  0 


(4.37) 


This  condition  is  motivated  by  the  fact  that  the  predominant  term 
(as  N  -►  <»)  in  Eq.  (4.36)  must  go  to  zero  as  N  -►  »  for  any  hierarchical 
routing  to  be  feasible.  (Notice  that  we  replaced,  in  that  term,  I  by 
and  h  by  aw'^.) 

The  proof  of  the  above  fact  follows  very  simply: 


1 


Let  m  be  fixed  and  m  ^  111^;  then  from  Proposition  3.4, 

eAXu 

and  from  Eq.  (4.37),  limit  — =  0;  hence 

N  -+■  00  "c 


limit  h/h  = 
N  -►  00  ^ 


N  00  0 


which  terminates  the  proof  of  the  above  fact. 

Ihe  above  limiting  throughput  is  equal  to  that  obtained  in  our 

previous  model  where  no  updates  are  considered.  It  is  a  more  realistic 

result  because  now  only  a  feasible  routing  (m  ^  m^)  can  achieve  that 

performance.  This  brings  us  to  a  consideration  of  the  range  of  m^  in 

terms  of  the  specific  choice  of  X  .  Let  X  be  of  the  form  X  =  N 

u  u  u 

(0  £  X  ^  1);  then  for  the  limit  in  Eq.  (4.37)  to  be  true,  m^  must  be 
such  that 


Thus 


^  ♦  X  -  V  <  0 

"0 

0  <  —  <  V  -  X 
nio 


For  the  above  relation  to  be  possible,  x  must  be  less  than  v  with  this 
condition 


> 


2 


V  -  X 


Applying  this  result  to  our  selected  scaling  schemes  for  X^,  and  using 
the  fact  that  0  <  v  ^  1 ,  we  arrive  at 

X  =  X^  —>  X  «  0  ^  >  2/v  >  2 

u  u  0  — 

V  0 

X*aNX  — >x»v  infeasible 

u  u 

X  =  aN^^^X®  *“>  X  »  v/2  ■•>  m,,  >  4/v  >  4 
u  u  0  — 
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As  a  consequence  only  the  first  and  third  schemes  may  yield  a  feasible 
hierarchical  routing.  Moreover  a  non-hierarchical  routing  is  always 
infeasible  at  the  limit  of  very  large  networks. 

The  above  properties  are  illustrated  in  the  numerical  study 

below. 


General  Behavior 

Similar  to  Section  4.2.1,  a  lower  and  tq>per  bound  on  the 
throughput  can  be  derived  using  Eqs.  (4.27)  and  (4.36). 
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Let  US  examine  the  behavior  of  T  /NA  with  respect  to  N  and  m  by  plotting 

c 

its  bounds  normalized  by  the  maximum  throughput  (per  channel  F/NA) , 

Eq.  (4.38);  i.e.,  we  will  plot 

LB(r  /NA)  UB(r^/NA) 

“S  -  1/T(,  -  1/T„ 

The  values  selected  for  the  different  variables  are: 

ViCq  «  6  messg/sec 
Tq  ■  .5  sec 
xl  .  .07  uc„ 
e  »  1/64 

Recall  that  E  and  I  are  given  in  Eq.  (3.52). 
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Three  sets  of  curves  are  shown  in  Figs.  4.6  -  4.8  and  corre¬ 
spond  respectively  to  the  three  specifications  of  i.e.,  X^  =  X®  , 

X^  =  N^^^X^/2  ,  X^  =  N^^^X®/2.  A  few  remarks  emerge  from  the 

observations  of  those  figures. 

Remark  1 :  Optimal  degree  of  clustering 

Lower  and  upper  bound  envelopes  are  also  plotted  (by  hand)  on 
each  of  those  figures.  As  a  result,  hierarchical  routings  with  the 
appropriate  degree  of  clustering  m,  will  achieve  a  throughput  (normal 
ized)  in  the  region  comprised  between  the  lower  and  i9>per  envelopes. 

The  optimal  m  corresponding  to  a  particular  envelope  can  be 
determined  numerically  as  well  as  the  envelope  itself.  Such  an  oper¬ 
ation  can  as  easily  be  done  by  hand  for  the  graphs  presented  here. 

Given  N,  choosing  m  on  the  lower  bound  envelope  (Fig.  4.6)  guarantees 
a  minimum  throughput  equal  to  the  corresponding  point  on  the  envelope. 
Fig.  4.9  shows  the  relative  table  length  (obtained  with  those  values 
of  m)  respective  to  Fig.  4.6,  and  also  the  discretized  number  of  levels 
(as  suggested  in  Section  3. 3. 3. 2). 


Remark  2;  Feasibility  and  viability 

The  fact  that  the  lower  bound  and  iqpper  bound,  for  a  specific 
set  of  parameters,  become  closer  for  small  values  of  and  meet  at 
=  0  indicates,  first,  up  to  what  size  N  a  certain  degree  of  cluster¬ 
ing  is  feasible,  and  second,  that  the  points  F^  =  0  occur  vdien  the  line 
capacities  are  totally  utilized  by  the  updates. 

Table  4.1  below,  shows  approximate  values  of  the  points  where 


Figurt  4.9.  Tabl«  Length  Atiociated  with  the  Lower  Bound  Envelope  on  the  Throughput;\u  -  Xu 


a  non-hierarchical  routing  (m  =  1)  becomes  infeasible  and  also  shows 
the  points  beyond  which  a  2-level  hierarchical  routing  becomes  certainly 
(due  to  a  lower  bound)  more  viable.  Notice  that  the  faster  the  rate 
of  update  exchange  is,  the  smaller  those  critical  values  of  N  are. 


Points  at  which  NCR 
becomes  infeasible 
(r  =  0  ,  m  =  1) 

Points  beyond  which 
clustering  is  better 

X  =x0 
u  u 

N  =  2000 

N  200 

=  1000 

=  150 

1^ 

=  300 

=  100 

Table  4.1  Critical  Values  of  N 


Remark  3;  Asynytotic  behavior 

The  shq>e  of  the  lower  bound  envelopes  for  X^  =  X®,  N^^^X^2, 

show  an  initially  decreasing  and  then  slowly  increasing  behavior  with 
respect  to  N;  the  increase  will  eventually  bring  the  curves  closer  to 
their  asyii^itote  1.  However,  for  X^  =  N^^^X®y^2,  the  lower  bound  enve¬ 
lope  is  a  decreasing  function  of  N  which,  as  predicted  earlier,  will 
eventually  reach  zero.  Ihis  means  that  in  the  neighborhood  of  a  certain 
size  N,  the  hierarchical  routing  altogether  becomes  infeasible.  For- 

g 

tunately,  that  size  is  well  beyond  10  1 

4. 3. 3. 3  Delay  at  Constant  Throughput  per  Node  (Dual  Scaling) 
r^/NA  is  maintained  constant,  i.e.,  T^/NA  »  g^,  hence 


(4.41) 


Dual  properties  to  the  primal  scaling  hold  true  here  and  are  illustrated 
in  the  curves  shovm  in  Figs.  4.10  and  4.11. 

Notice  also  that  under  the  condition  of  Eq.  (4.37)  and  for  a 
given  m,  m  ^  then 


limit 

N  -►  00 


T 

c 


which  is  the  dual  of  Eq.  4.38,  and  is  used  to  normalize  the  values  of 
in  the  plots. 


4.4  A  Queueing  Model  with  No  Updates  and  with  Storage  Limitation 

For  large  networks,  the  earlier  infinite  nodal  storage  assun^- 
tion  becomes  less  reasonable.  The  purpose  of  this  section  is  to  develop 
and  analyze  a  Kleinrock-like  model  which  takes  into  consideration  the 
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LBIT^  (AiCo-  g^)) 


I 


Figure  4.10.  Delay  at  Constant  Throughput  per  Node;  Model  with  Updates,  Xu  -  \ 
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limitation  of  nodal  storage.  Then  based  on  that  model,  we  study  the 
behavior  of  hierarchical  routing  as  applied  to  the  class  of  symmetrical 
networks.  We  choose  not  to  account  for  the  effect  of  updates  on  the 
line  capacity  utilization,  in  order  both  to  model  situations  where 
updates  can  be  neglected  and,  mainly,  to  isolate  the  consequences  of 
finite  nodal  storage  on  the  network  performance. 

Similar  to  the  previous  section,  this  study  will  also  demon¬ 
strate  a  remarkable  efficiency  of  hierarchical  routing  in  large 
networks . 


4.4.1  A  Loss-Queueing  Model  for  Symnetrical  Networks 

4.4. 1.1  The  Model 

Again  we  consider  the  class  of  symmetrical  networks  and,  in 
addition,  a  constraint  is  i]iq)osed  on  the  number  of  buffers  reserved  for 
the  store-and- forward  function.  As  a  result  of  the  limited  storage, 
three  issues  arise:  the  validity  of  the  exponential  message  length 
distribution;  the  fate  of  the  rejected  messages;  and  the  sharing  of  the 
pool  of  S/F  buffers  among  the  outgoing  channels. 

With  respect  to  the  message  length,  it  is  clear  that  a  maximum 
size  must  be  imposed,  as  is  always  the  case  in  practical  situations. 

As  an  example,  in  the  ARPANET  the  maximum  packet  size  is  equal  to  1008 
data  bits.  The  IMP  S/F  storage  is  divided  into  buffers  (pages),  each 
of  which  can  accomodate  a  maximum  size  packet  and  cannot  be  utilized 
by  more  than  one  packet  at  a  time.  As  a  result,  one  might  feel  that 
the  assunption  of  exponentially  distributed  packets  should  be  replaced 
with  a  constant  length  packet  assumption.  However,  measurements  on  the 


ARPANET  [KLEI  74]  have  shown  that  the  average  size  of  a  data  message  is 
roughly  250  bits.  The  fact  that  the  average  message  length  is  much 
smaller  than  the  buffer  size,  and  recognizing  that  messages^  which  do 
not  fit  in  a  single  buffer  occur  with  a  very  small  probability  (and 
hence  can  be  neglected),  motivates  us  to  keep  the  exponential  message 
length  assumption.  A  better  approximation  would  be  to  assume  a  trun¬ 
cated  exponential  message  length  distribution,  but  this  makes  the 
analysis  much  more  complicated  and  no  closed  form  solution  has  been 
obtained  [CHU  69]. 

As  for  the  rejected  messages,  they  can  be  either  transmitted 
by  the  sending  node  (after  a  time-out,  as  in  the  ARPANET)  or  considered 
as  lost  (as  with  blocked  telephone  calls).  The  retransmission  mode  is 
what  actually  prevails  in  general  S/F  networks  like  the  ARPANET,  NPL, 
etc.  However,  this  mode  introduces  strong  dependencies  among  the  sto¬ 
chastic  behavior  of  neighboring  (and  even  more  distant)  nodes  [ZEIG  71] 
to  the  point  that  an  analysis  seems  out  of  reach.  As  a  result,  we  will 
restrict  our  considerations  to  a  loss  model,  in  which  case  the  depen¬ 
dencies  between  nodes  due  to  storage  limitation  are  eliminated. 

Finally  with  respect  to  the  sharing  of  the  pool  of  S/F  buffers 
between  the  outgoing  channels,  the  S/F  function  of  a  node  can  be  rep¬ 
resented  as  in  Fig.  4.12a.  There,  we  see  that  accepted  messages  are 
first  submitted  to  the  routing  policy  and  then  conceptually  join  a  pool 
of  buffers  (B  buffers) .  The  routing  decision  is  assumed  to  be  fixed 
(hence  independent  of  buffer  utilization).  As  a  consequence,  a  node 

^Recall  the  single  packet  messages  assun^tion. 


is  equivalent  to  R  single  queueing  systems  which  share  a  pool  of  B 
buffez^  under  some  scheme.  A  given  channel  can  only  serve  messages 
routed  to  that  channel;  hence  messages  can  be  considered  as  forming  R 
classes  of  customers,  and  those  pertaining  to  class  i  are  to  be  imiquely 
served  by  the  i^^  channel  of  that  node  i  =  1,  2,  R  (see  Fig. 

4.12b).  Also  similar  to  the  independence  assumption  [KLEI  64],  we  will 
assume  that  nodes  (channels  in  different  nodes)  are  stochastically 
independent  and  that  the  input  streams  at  each  node  are  governed  by 
independent  Poisson  distributions. 

As  a  result  of  the  above  considerations  and  assumptions  intro¬ 
duced  along  with  them,  the  network  can  be  considered  as  a  collection  of 
independent  nodes,  each  of  which  can  be  modelled  as  R  m|m|1  queues 
sharing  a  waiting  room  of  size  B.  The  traffic  offered  at  each  node  is 
governed  by  a  fixed  routing  decision,  and  the  probability  of  blocking 
at  a  node  is  a  function  of  the  sharing  scheme  utilized.  Several  such 
schemes  are  proposed  and  studied  in  Appendix  B.  We  list  among  them: 
conplete  sharing  (CS);  sharing  with  a  constraint  on  a  maximum  queue 
length  (SMXQ) ;  sharing  with  a  minimum  allocation  of  buffers  per  queue 
(SMA),  etc. 


'9 


4.4. 1.2  Analysis 

We  first  consider  a  few  remarks  before  we  proceed  with  the 
analysis.  Because  of  the  symmetry  of  our  class  of  networks,  we  assume 
that  the  fixed  routing  results  in  equal  offered  loads  on  the  channels. 
Ihe  offered  load  A  is  defined  as  the  input  rate  of  traffic  before 
acceptance  or  rejection  by  a  node.  Moreover,  all  nodes  are  assumed  to 
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contain  the  same  number  of  buffers  B  and  to  use  the  same  buffer  sharing 
strategy.  As  a  result,  the  probability  of  blocking  (to  be  denoted  by 
Pg)  is  the  same  at  all  nodes. 

Because  of  the  possiblity  of  loss  of  messages,  the  offered 
external  traffic  F  is  no  longer  equal  to  the  throughput  of  the  network, 
which  we  denote  by  F^  (s  for  successful  traffic).  In  what  follows,  we 
intend  to  find  F^  and  the  average  delay  T  of  the  successful  traffic. 

Throughput  versus  load 

F  is  now  referred  to  as  the  traffic  load.  Let  us  define  P 

s 

as  the  probability  that  in  a  steady  state,  a  message  transmitted  over 
the  network  reaches  its  destination  successfully.  Clearly 


P  *  F  /F 
s  s 


(4.44) 


Let  h  be  the  discrete  random  variable  representing  the  distance 
in  hops  between  any  pair  of  nodes,  as  derived  from  the  fixed  routing 
policy.  Also,  let  Pj.[h  =  k]  be  the  fraction  of  node-pairs  at  distance  k 

Ic 

and  H(z)  the  corresponding  z-transform,  i.e.,  H(z)  =  E.z  P  [h  =  k]. 

jc  r 

(See  Appendix  A,  Section  A. 4  for  more  details  and  for  the  computation 
of  H(z)  for  a  torus  net.) 

Because  of  the  uniform  traffic  assunption. 


P  =  £  P_  [message  successful/message  travels  over  k  hops]  P_[h  =  k] 

®  k>l 


Since  nodes  are  assumed  independent,  the  probability  that  a  message  is 

Iq 

not  rejected  over  k  hops  is  [1-P_]  •  Notice  that  a  generated  message 

B 

is  subject  to  rejection,  whereas  a  message  reaching  its  destination 
is  always  accepted.  Thus 
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And  from  Eq.  (A. 25) 


P  =  L  [1  -  Pol^th  =  k] 

®  k>l  B  r  ~ 


f 


Hence 


Ps  =  H(1  -  Pg) 


(4.45) 


=  H(1  -  Pg)r 


(4.46) 


Relation  between  the  load  F  and  the  total  offered  internal  traffic  A 

A  is  now  the  sum  of  the  offered  input  rates  to  all  the  network 
channels.  With  the  above  definition  of  X,  the  relation  A  =  MAX  still 
holds  true.  However  Eq.  (4.6)  (A  =  nF)  is  no  longer  true  due  to  the 
possible  loss  of  messages.  A  similar  approach,  as  used  for  the 
derivation  of  Eq.  (4.6)  in  [KLEI  64],  is  considered  here  to  derive  a 
relation  between  A  and  F 

The  contribution  of  Yg^>  the  rate  of  traffic  from  s  to  t 
(Ygt  =  Y).  to  A  is 


’'st'^  .  1  -  (1  -  Pg) 

^  ^st^^  -  V - P - - 

k=0  “  B 


St 


'st 


The  proof  goes  as  follows:  if  h^^  is  the  length  (in  hops)  of  the 

st^ 

unique  path  from  s  to  t,  then  the  contribution  of  y^^  to  the  1  hop 

,nd 


(i.e.,  node  s)  is  Yg^J  to  the  2  hop,  Y5t^(l  "  Pg)  J  “'‘I  i*'  general, 

to  the  k^^  hop  it  is  y  (1  -  P_)^~^  •  The  rest  of  the  proof  is  obvious. 

sr  B 

Recall  that  F  =  N(N  -  1)y  and  y^^  =  y,  then 


E 

s,t 


h 

(1  -  Pg) 

N(N  -  1) 
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Grouping  together  all  paths  of  length  k  and  using  Eqs.  (A. 24)  and  (A. 25) 
we  arrive  at 

A  =  ^  1  -  E  (1  - 

B  L  k>l  B  r  ~ 

1  -  H(1  -  PJ  1  - 

hence  A  =  - - - —  T  =  ^  T  (4.47) 

B  *^B 

The  above  relation  (and  Eq.  (4.46))  is  quite  general;  it  only  assumes 
that  all  nodes  are  independent  and  have  an  equal  probability  of  blocking 
Pg.  H(z)  can  be  determined  cuialytically  or  numerically  given  a  partic¬ 
ular  network  and  the  associated  fixed  routing  policy. 

Notice  that  if  Pg  =  0,  i.e. ,  infinite  nodal  storage  assumption, 
Eq.  (4.47)  becomes  undefined;  however,  the  application  of  L'Hopital's 
rule  results  in  A  =  H'(l)r.  H'(l)  is,  in  fact,  equal  to  the  average 

network  path  length;  hence  we  are  back  to  the  expression  derived  in 
[KLEI  64]  (i.e.,  A  =  nf) . 


Average  delay  of  successful  traffic 

Due  to  the  symmetry  of  our  class  of  nets,  a  non-rejected 
message  will  incur  the  same  delay  t  at  each  hop;  therefore  the  average 
network  delay  is, 

T  =  n  t 
s 

where  n^  is  the  average  path  length  of  the  successful  traffic.  Note 

that  h  is  no  longer  the  average  path  length;  this  is  intuitively  due 

to  the  fact  that  messages  which  travel  on  longer  paths  are  more  likely 

to  be  rejected.  Therefore,  we  expect  n^  to  be,  in  general,  smaller 

than  h.  The  determination  of  n  follows  similar  to  that  of  P  . 

s  s 


Let  Yj]j  be  the  rate  of  successful  traffic  from  node  j  to  node  k,  and 


the  path  length  between  the  two  nodes,  then  obviously 


Y-]k  =  (1  -  Pg) 


Also,  the  sum  of  Yjj^  is  what  we  have  defined  as  Then,  from  the 


definition  of  the  average  path  length  (Eq-  (A. 3)), 


53  53  Y!i.h, 


_  J 


jk“jk 


E  E 


f  Z  E  (1 

s  j  k 


^ik 

P„)  J  Y.,h., 
B-'  ’jk  jk 


Recall  that  Yjj^  =  Y»  and  F  =  N(N  -  1)y;  thus 


F  V  ^  ,,  Y^k  ^ik 


"s  =  f  ^  ^  0  -  V 
s  j  k 


N(N  -  1) 


Grouping  together  all  paths  of  length  k,  we  arrive  at 

n  =  |{3  Ed-  Pj’^kP^^  =  k] 

*  s  kal  ^ 

From  the  definition  of  H(z)  (Eq,  (A. 25)  and  Eqs.  (4.44) ,(4.46)) ,  we 
find 

1  -  Pr  H'(1  -  PJ 

"s  =  -  V  =  -  V  H(i  -  P- y 

S  D 

where  H'(z)  is  the  derivative  of  H(z)  with  respect  to  z. 

Note  that  if  P„  =  0,  then  n  =  h  (recall  that  H'(l)  =  h  and 
b  s 

H(l)  =  1). 

If  we  let  P_  -*•  1  and  apply  I'Hopital's  rule  to  the  above 

D 

equation  (recall  that  H(0)  =  0),  we  arrive  at  n  =1.  This  result 


indicates  that  at  the  limit  (Pg  -►  1)  only  1-hop  traffic  may  still  be 
successful . 

Probability  of  Blocking 

As  noticed  earlier,  a  node  may  be  modelled  by  R  M|m|1  queueing 
systems  with  a  shared  finite  waiting  room  of  size  B.  Each  server  is 
offered  an  input  rate  A  (A  =  A/NA,  Eq.  (4.47)),  and  has  a  service  rate 
equal  to  yC.  Among  the  sharing  schemes  studied  in  Appendix  B,  complete 
sharing  is  optimal  when  p  is  small.  Below,  we  show  that  the 

maximum  throughput  is  obtained  for  a  load,  say  F^,  which  is  such 
that  p  <  1;  moreover  in  order  to  maintain  a  good  network  performance 
some  control  should  be  introduced  in  order  to  keep  the  offered  load  at 
a  level  less  than  or  equal  to  F  .  As  a  result  it  seems  reasonable  to 
choose  a  complete  sharing  (CS)  scheme.  However  if  a  retransmission 
mode  is  used  or  if  the  channels  receive  very  unbalanced  traffic  loads 
then  a  different  scheme  may  be  necessary. 

The  analysis  of-  the  CS  scheme  in  ^pendix  B  leads  to  an 
expression  of  Pg  in  terms  of  A/yC,  Eq.  (B.33),  which,  combined  with 
Eq.  (4.47)  results  in  the  system  of  equations  below. 


15S 


Also  the  average  delay  per  channel,  t,  for  successful  traffic  is  given 
by  Eq.  (B.37)  hence 


(4.50) 


The  above  system  of  equations  (4.49a-b),  may  be  solved  numer- 
i.'.ally.  A  numerical  algorithm  will  be  presented  below.  Once  we  know  X 
and  Pg,  we  can  determine  the  performance  of  the  net  in  terms  of 
throughput  and  delay  T.  Before  we  proceed,  let  us  derive  the  limit¬ 
ing  throughput  of  the  network  obtained  with  an  infinite  traffic  load. 


Limiting  throughput 

We  claim  that 

limit  =  _i-_  H'(0)NAyC  (4.51) 

P  00 

Proof; 

From  the  determination  of  A  we  see  that  X  >  T/RN  (i.e.,  X  is 
greater  than  1/R  the  traffic  generated  at  the  node  itself),  hence 
r  -►  00  =>  X  -*■  ®  ,  Also  from  Eq.  (4.49b)  it  is  clear  that 

"b  -  ‘  -  bT  I'—  f  * 

where  0(1/X)  is  such  that  limit  X0(1/X)  =  0.  Thus,  limit  P„  =  1. 

Now  from  Eq.  (4.49a)  we  see  thjtt  Pg  =  1  •=>  X  =  T/NA.  As  a  result. 
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limit  A  -?r  =  1 

r  -»■  00 

Also  from  Eq.  (4.52) 

limit  ^  (1  -  P,)  =  B"  R  ■  1- 

I  00 

Now  from  Eq.  (4.46) 

Tg  =  rH(i  -  Pg)  =  (1  -  Pg)rH'(o)  +  rod  -  Pg) 

Then,  combining  the  above  facts,  we  arrive  at  Eq.  (4.51). 

This  limiting  result  has  the  following  simple  interpretation. 
First,  note  that  H'(0)  =  P^[h  =  1]  is  the  fraction  of  pairs  of  nodes 
at  distance  one  (i.e.,  neighboring  nodes),  and  (ByC/(B  +  R  -  1)  is  the 
limiting  throughput  of  any  channel  of  the  R  M|m|1  system  of  queues 
(see  Eq.  (B,39)).  As  a  result,  the  limiting  throughput  represents  the 

fraction  of  successful  traffic  which  has  to  travel  over  a  single  hop. 

TTie  other  fraction  (finite)  of  initially  successful  traffic  has  to 
travel  over  at  least  another  hop;  in  trying  to  do  so,  it  will  conpete 
with  an  infinite  amount  of  traffic  generated  at  the  next  node,  and  thus 
it  will  be  rejected,  "niis  checks  with  the  previous  result:  Pg-»-  l=>ng-^  1. 

Algorithm  for  the  solution  of  Eg.  (4.49) 

Let  us  study  the  behavior  of  the  two  equations  (4.49  a  -  b) 
i.  Variation  of  X  versus  Pg,  Eq.  (4.49a).  Differentiating 
Eq.  (4.49a)  with  respect  the  Pg,  we  find 
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X  ’’b”'  H  -  PbI  -  ^  *  "  t‘  -  ’’b^ 

BPb  -  NA  p2 

From  the  definition  of  H(z),  Eq.  (A. 25) 


Let  z  =  1 


H'(z)  =  Z 

k>l  ^ 


the  numerator  of  the  2 


nd 


[h  =  k] 

t 

fraction  becomes 


Y  =  (1  -  z)H'(z)  -  1  +  H(z) 

Y  =  L  [z^  -  1  +  (1  -  z)kz’''^]  P  [h  =  k] 

k>l  ^  ^ 

Let  Xj^  be  the  quantity  in  brackets 


^  k-2 

k(k  -  Dz*"  ^(1  -  z) 

Hence  for  0  ^  z  £  1,  dXj^/dz  ^  0,  thus  Xj^  is  an  increasing  function  of 
2  and  finally  Y  is  an  increasing  function  of  z.  When  z  varies  from 
0  to  1,  i.e.,  when  Pg  varies  from  1  to  0,  Y  varies  from  H*(0)-  1  to  0. 

Since  H'(0)  =  P^[h  =  1],  then  H' (0)  -  1  <  0  and  Y  <  0  ¥  z  e  [0.1]. 
Therefore,  X  is  a  decreasing  function  of  P_  (see  Fig.  4.13)  and  from 

D 

previous  remarks:  Pq  =  1  =>  X  =  F/NA;  P^  =  0  =>  X  =  hF/NA. 

D  B 

ii.  Variation  of  Pg  with  respect  to  X  ,  Eq.  (4.49b) 

It  can  be  easily  shown  that  the  derivative  of  Pg  with  respect 
to  X  is  always  positive.  Tlius  Pg  is  an  increasing  function  of  X.  Also 
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X  =  0  =>  Pg  =  0  ,  dPg/dX  =  0 
X  ^  00  =>  1  ^  dPg/dX  ■>  0 


X  VERSUS  Pg  Pg  VERSUS  X 


I 


rVNA 


h  r/NA 


Figure  4.13.  Soiution  of  Eq.  (4.49). 


iii.  Algorithm, 


Fig.  (4.13)  shows  that  there  is  a  unique  solution  to  Eq. 
(4.49).  The  numerical  evaluation  is  straightforward,  and  can  use  any 
converging  iteration  procedure.  Namely,  let  us  consider  the  sequences 
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{Xj^}  and  {Pg}  such  that, 
k  k 

"=•  ’’b 

Aj  .  f(pj)  -> 

'  k-1  ‘k 

A'  =  f(Pj  ')  =>  P^ 

k 

where  the  Aj^'s  are  computed  from  Eq.  (4.49a)  and  the  Pg's  from  Eq. 

(4.49b).  Obviously,  X  =  limit  X.  is  the  solution  of  the  system  of 

k  -><» 

equations.  In  practice,  we  stop  the  search  when  I  Aj^  ■  Aj^  ^  |  ^  6  ; 

where  6  is  a  certain  tolerance. 

We  are  now  ready  to  consider  a  numerical  example. 


■  IT' 


•'(T' 


4.4. 1.3  Application  of  the  Loss  Model  to  Torus  Networks 

Let  us  consider  a  torus  (i.e.,  R  =  4)  operated  with  a  fixed 
shortest  path  routing  whose  z- transform  is  given  by  Eq.  (A. 41)  (where 
P  =  n/n).  Of  interest  is  the  study  of  the  behavior  of  the  successful 
traffic  with  respect  to  the  load  F,  as  well  as  the  behavior  of  the 
delay  T  with  respect  to  F  or  F^. 

Numerical  results  are  shown  in  Figs.  4.14  -  4.17.  Those 

-4 

results  were  obtained  for  N  =  121,  6  =  10  ,  and  yC  =  20  msg/sec. 

More  precisely,  the  graphs  show  the  normalized  traffic  and 
delay.  The  normalization  is  based  on  Eq.  (4.19)  which  defines  the 
utilization  of  the  net,  and  on  Eq.  (4.18).  Thus  the  curves  show  the 
variables 
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Jl  X  ^  A 

^  ^  yC  NA  ’  "  yC  NA  ’  h 

Figs.  4.14  and  4.15  show  (linear  and  semi-log  abscissa)  that  as  F 

increases,  F  increases  to  a  maximum  value  and  then  decreases  to  its 
s 

limiting  value  in  Eq.  (4.51).  These  results  are  similar  to  that  of 
a  contention  system  [TOBA  74],  [LAM  74],  except  that  the  non¬ 
retransmission  (loss)  of  rejected  messages  eliminates  the  possibility 
of  unstable  states. 

Note  that  if  B  =  <»  then  F^  is  equal  to  F  for  p  varying  from 
0  to  1.  For  p  ^  1  a  steady  state  solution  does  not  exist.  This  is  no 
longer  true  for  a  finite  buffer  size.  However,  with  limited  storage, 
as  p  increases  beyond  1,  the  throughput  decreases  quite  a  bit  and  thus 
more  messages  are  being  lost.  Another  effect  of  finite  storage  is 
reflected  in  the  behavior  of  the  average  delay  T  which  asymptotically 
reaches  a  constant  value  as  p  goes  to  infinity  (see  Fig.  4.16).  Since 
for  p  -►  00  (i.e.,  F  -►  oo)  only  1-hop  traffic  may  be  successful  (see 

Eq.  (4.51)),  then  the  asymptotic  value  of  T  corresponds  to  the  delay 
on  one  hop  (i.e.,  at  one  node)  under  the  condition  of  an  infinite  input 
rate.  From  Eq.  (B,39),  that  value  for  one  node  is:  limit  yCt  = 

P  -»■  00 

B/ (B  +  R  -  1);  thus  limit  yCT/h  =  B/[(B  +  R  -  l)h]. 

p  -*■  00 

Fig.  4.17  illustrates  the  behavior  of  T  versus  F  .  Note  that 

s 

to  a  given  F^  correspond  2  points  on  the  curve.  This  is  due  to  the 
fact  that  there  are  two  possible  values  of  F  which  lead  to  a  specific 
value  of  F^  (see  Fig.  4.14). 

With  regard  to  the  number  of  buffers  B,  the  plots  show  initial 
substantial  improvements  in  maximum  throughput  as  B  increases,  and 
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Figure  4.14.  Normalizad  Throughput  p^,  versus  Normalized  Load  p,  for  a  121-Node  Torus  with 
Storage  Limitation. 


B=  12 
B=  16 
B»20 


costlier  ones  when  we  reach  values  of  approximately  between  0.8  and 
0.9.  This  phenomenon  is  better  shown  in  Fig.  4.18  where  the  probability 
of  loss,  1  -  P^,  is  plotted  versus  the  normalized  load  for  different 
values  of  B.  Finally,  Fig.  4.19  shows  the  behavior  of  n^  normalized  by 
h,  with  respect  to  1  -  P^. 

In  summary,  we  are  now  able  to  evaluate  the  iirportance  of  buf¬ 
fer  storage,  and  as  expected,  small  values  of  B  can  degrade  the  network 
performance  quite  a  bit.  For  the  example  above,  B  =  20  reduces  the 
performance  to  roughly  0.68  that  which  could  otherwise  be  obtained  with 
an  infinite  number  of  buffers. 


4.4.2  Performance  Evaluation  of  Hierarchical  Routin 


The  above  model  and  Assumption  4.1  will  now  allow  us  to  study 
the  behavior  of  hierarchical  routing  for  the  class  of  symmetrical  nets. 
The  scaling  schemes  need  further  specification  as  to  the  scaling  of  the 


nodal  storage  (B) . 


4.4. 2.1  A  Buffer  Scaling  Scheme 


As  seen  in  the  conclusions  above,  the  effect  of  B  is  naturally 
characterized  by  the  maximum  throughput  F^,  or  using  our  normalized 
notations,  it  is  characterized  by  the  maximum  of  =  hr^/yOJA.  Under 
the  conditions  of  Section  4.1.2,  the  primal  scaling  scheme  maintained 
p  constant  as  N  varied.  It  is  now  natural  to  attempt  to  keep  maximum 

Pg  constant.  With  this  objective  in  mind,  let  us  observe  the  effect  of 

storage  limitation  first,  in  a  single  node  situation,  then  in  a  network 

environment.  From  Eq.  (4.49b)  we  see  that  Pg  depends  only  on  X/yC  and 

B.  Thus  a  scaling  which  maintains  X/yC  and  B  constant  will  result  in 
a  constant  Pg.  Recall  that  under  the  conditions  of  Section  4.1.2, 
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X  =  hr/NA;  hence  X/yC  is  just  our  previous  p  that  the  scaling  maintained 
constant. 

In  a  network  environment  keeping  constant  will  still  result 

O 

in  a  smaller  probability  of  success  as  the  network  grows.  This  comes 
about  because  of  the  increase  in  network  path  length  (aN^)  which  results 
from  a  larger  N.  It  is  then  necessary  to  increase  B  with  N,  in  order 
to  maintain  p  constant. 

An  ad-hoc  (heuristic)  scaling  scheme  of  B  has  been  devised 
which,  as  we  will  see,  satisfies  our  needs  to  a  large  extent.  Such  a 
scheme  is 


(b  -  B(,  in 


(4.53) 


This  scheme  has  been  tested  on  the  torus  nets  as  shown  in  Fig^  4.20 
where  p^  is  plotted  versus  p  for  a  set  of  values  of  N  and  two 
values  of  B^.  We  notice  that  the  maxima  of  p^  occur  for  roughly  the 
same  value  of  p.  The  curves  (except  for  N  =  25)  are  also  fairly  close 
to  each  other  up  to  and  somewhat  beyond  the  maxima.  Since,  in  fact, 
networks  must  be  operated  in  that  range  and  since  we  are  dealing  with 
large  networks,  this  scaling  scheme  appears  to  be  fairly  satisfactory. 
The  aforementioned  figures  showed  the  overall  behavior  of  P 

s 

versus  p  for  a  set  of  values  of  N.  If  we  restrict  our  ovservations  to 
the  maximum  throughput  p^,  we  obtain  the  curves  shown  in  Fig.  4.21. 
Ihose  curves  were  obtained  using  the  cojnbined  scaling  scheme 


j  C  =  hC^ 

(  B  =  [Bq  In  h] 


(4.54) 
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WITH  T  <  1 


(iC  = 
B  = 
Bo  = 
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102  10^  10® 

N 

Figure  4.22.  Buffer  Seeling:  ProtMbility  of  Success. 
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and  for  a  maximum  detorpiined  with  or  without  a  constraint  on  the 
average  delay  T.  The  maximum  is  computed  numerically  using  a 
Fibonacci  search  [ZANG  69].  The  probability  of  success  the 
normalized  delay  pCT/h,  and  the  normalized  average  path  length  n^/h 
at  those  maxima  of  p^,  are  shown  respectively  in  Figs.  4.22  and  4.23. 

A  few  remarks  emerge  from  the  observations  of  those  graphs: 

i.  The  curves  of  max  with  no  constraint  on  T,  and  that 
of  the  corresponding  T,  show  again  that  our  scaling  scheme  is  quite 
acceptable,  especially  when  N  >  1000.  For  that  range  of  N,  the  maxima 
of  Pg  are  between  0.7  and  0.8. 

ii.  The  fact  that  max  P  with  the  constraint  T  <  1  becomes 

s  — 

constant  (-  .74)  for  N  roughly  greater  than  10^  can  be  explained  as 
follows.  Figs,  4.22  and  4.23  show  that  the  corresponding  and  ^s/h 
are  equal  to  one  for  the  same  range  of  N.  This  means  that  at  such  a 
load,  B  is  large  enough  to  give  a  negligible  loss;  hence  it  is  safe  to 
use  the  earlier  model  based  on  the  infinite  nodal  storage  assumption 

see  Section  4.1.2).  That  model  predicts,  with  C  =  hC^  and  T  =  T^  =  1, 

“1  hr 

a  value  of  F/NA  =  -  T^"  .  Thus  we  expect  max  p^  = 

^^*^0  ”  T^)  ”  checks  with  the  value  found  in  the  plots. 

A  final  question  arises  as  to  the  sensitivity  of  the  above 
results  to  the  scaling  of  C.  For  that  purpose,  let  us  prove  the  fol¬ 
lowing  proposition. 

Consider  a  symmetrical  net  of  size  N,  which  is  such  that  all 
channels  are  of  capacity  Cj  or  network  loads  proportional  to 

the  capacity  assigned,  result,  for  a  fixed  B,  in  throughputs  propor¬ 
tional  to  the  capacities,  i.e.. 
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(4.55) 


=> 


=  yc^T^ 


where  F^,  F^  and  are  the  load,  throughput  and  delay  with  assignment 
C.  (i  =  1,  2) 


Proof 

Let  Aj  and  be  the  unique  solution  of  Eq.  (4.49)  for 

F  =  Fj  and  C  =  C^.  It  is  obvious  that  X2  =  AjyC2/viCj  and  Pg^^  =  Pg^^ 

is  the  unique  solution  of  Eq.  (4.49)  for  ^  =  F2  and  C  =  C2.  This  fact 

combined  with  Eqs.  (4.46)  and  (4.50)  proves  the  above  proposition. 

As  a  result,  the  normalized  notation  (p,  p^)  is  not  sensitive 

to  the  particular  capacity  assignment  for  a  given  N  and  B,  i.e.. 


P 


1 


=*> 


The  above  property  implies  that  the  same  curves  as  in  Fig.  4.20 
can  be  obtained  with  any  capacity  assignment  for  a  given  N  and  B. 
Moreover,  the  maxima  of  are  also  insensitive  to  the  particular  choice 
of  capacity  (recall  that  all  channel  capacities  are  equal)  at  a  given 
N  and  G.  This  in^lies  that  all  the  points  in  the  curves  in  Figs.  4.21  - 
4.23  depend  only  on  N  and  the  buffer  scaling. 

With  the  capacity  scaling  C  =  hC^  ,  becomes 


“  mC, 


F 

s 

NA 


(4.56) 


which  indicates  that  if  the  scaling  of  B  keeps  p^  constant,  then  the 
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scaling  of  C  will  keep  the  throughput  per  node  constant,  and  that  ful¬ 
fills  our  objective. 

4. 4. 2. 2  Behavior  of  the  Hierarchical  Routing 

Recall  that  Assumption  4.1  leads  us  to  consider  a  fixed  routing 
to  model  an  adaptive  hierarchical  routing.  Provided  we  know  the  distri¬ 
bution  of  the  path  length  of  the  equivalent  routing,  we  can  use  the  loss 
model  and  scaling  scheme  to  predict  the  behavior  of  such  routing  schemes 
as  m  and  N  vary.  Once  again,  we  use  the  bounds  of  Qiapter  3  to  charac¬ 
terize  the  message  path  lengths  which  result  from  a  hierarchical  routing 


Distribution  of  Path  Length 

Since  the  distribution  of  path  length  deals  with  paths  on  a 
node-pair  basis,  we  can  no  longer  use  the  bound  E,  which  was  only  valid 
for  the  average  distance.  Fortunately  Lemma  3.2  (see  Eq.  (3.29))  pro¬ 
vides  us  with  the  more  general  bound  on  individual  paths  (to  be  denoted 
by  A)  m-1 

1  i  =  E  ^  =  s 

However  this  bound,  always  true  for  CER,  is  only  valid  with  OBR  if  s,t 
belong  to  a  lower  level  cluster  than  (see  Eq.  (3.31)).  The  fact  that 
first,  A  is  a  very  generous  bound,  and  second  that  we  expect  OBR  to 
behave  better  than  CER  makes  us  feel  confident  to  use  A  as  an  approxi¬ 
mation  on  paths  for  the  OBR  scheme.  Note  that  a  rigorous  but  extremely 

m 

generous  bound,  may  be  obtained  from  Lenrnia  3.1  (h^^  £  ^  dj^) . 

k — 1 

From  Eq.  (3.46)  and  for  our  class  of  networks,  A  is  simply  equal  to  hF 
(see  Eqs.  (3.46)  and  (3.47));  and  for  an  optimal  clustering  of  degree  m 

nV  .  „v/m 

A  =  b  - +  c(m  -  1)  (4.57) 

N  '  -  1 
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Note  again  that  m  =  1  =>  A  =  0. 


A  "best  case"  and  a  "worst  case"  distribution  can  now  be 

c  c 

defined  if  we  assume  respectively  that  h  =  h  and  h  =  h  +  A. 

dir  dt  dir  dir 

To  the  "best  case"  distribution  corresponds  the  z-transform 
H(z)  of  the  shortest  paths  in  the  network. 

The  z-transform  H  (z)  of  the  worst  case  distribution  can  be 
expressed  in  terms  of  H(z)  and  A,  as  follows 

H^(z)  =  z^H(z)  (4.58) 

To  prove  the  above  equation,  let  h^  be  the  deterministic  random  variable 
representing  the  path  length,  then 

Pj.[hc  =  *^1  =  Pj.[h  =  k-A]  V-k^A 

Thus 

HJz)  =  L  =  kl  =  z^  L  z^P  [h  =  i] 

k>A  ^  i  >0  ^  '' 

which  proves  Eq.  (4.58). 

The  above  remark  that  m  =  1  =>  A  =  0  implies  also  that 

m  =  1  =>  H^(z)  =  H(z).  This  fortunate  property  preserves  the  conti¬ 
nuity  of  hierarchical  routings. 

Buffer  Assignment  and  Feasibility 

Recall  that  in  this  study  we  intend  to  account  for  the  storage 
utilized  by  the  routing  tables.  The  size  of  such  a  storage  is  a  linear 
function  of  the  table  length  and  counted  in  number  of  buffers  it  is 
equal  to  _ 

W\ 


where  1/62  is  the  number  of  entries  which  fit  in  one  buffer.  As  a 
consequence,  if  the  total  number  of  buffers  to  be  shared  between  the 
routing  table  and  the  S/F  function  is  as  defined  in  Eq.  (4.53),  then 
the  nuiid>er  of  buffers  strictly  reserved  for  the  S/F  function  is 

B  =  [Bq  in  h]  -  (■£24'!  (4.59) 

With  an  optimal  clustering  of  degree  m,  and  for  our  class  of  symmetrical 

y 

nets  (h  =  aN  ) ,  the  above  equation  becomes 

B  =  |'BQ(ln  a  +  vlnN)]  -  [e2niN^^'®]  (4.60) 

For  a  hierarchical  routing  to  be  feasible,  B  must  be  greater  than  or 
equal  to  one.  Observing  Eq.  (4.60)  we  may  conclude  that: 

i.  For  a  fixed  m,  the  routing  becomes  infeasible  for  networks 

of  size  larger  than  a  critical  nuirt)er  N  .  N  is  the  solution  of  B  =  0 

c  c 

in  Eq.  (4.60)  and  it  is,  obviously,  an  increasing  function  of  m. 

ii.  For  very  large  networks,  and  under  the  condition 

BqV  ^  £26,  only  a  hierarchical  routing  operating  with  a  global  minimum 
table  length  is  feasible,  i.e.,  ra  =  m*  =  In  N;  hence  it  =  mN^^"*  =  e  In  N. 

In  summary,  as  N  gets  larger,  it  becomes  imperative  to  move 
toward  more  clustering,  eventually  reaching  a  globally  minimum  table 
length.  The  decision  to  use  a  higher  degree  of  clustering  m  should  be 
weighed  against  the  degradation  incurred  by  the  corresponding  increase 
in  network  path  length.  This  phenomenon  is  illustrated  in  the  next 
section. 

Numerical  Application 

From  the  above  considerations,  the  application  of  the  loss 
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model  to  a  network  operated  with  a  hierarchical  routing  results  in  the 
evaluation  of  the  worst  and  best  case  performances. 

The  worst  case  performance  is  characterized  by 


r,  =  H^d  -  Pjir 

The  probability  of  blocking  Pg  is  the  solution  of  Eq.  (4.49)  where  H 

is  replaced  by  H  .  We  note  that  because  H  (z)  is  similar  to  H(z)  the 
c  c 

iterative  algorithm  devised  to  solve  Eq.  (4.49)  is  still  valid,  except 
that  the  initial  value  of  the  sequence  {Xj^}  is  now 

1  -  H  (1  -  Pg)  r  r 

=  limit  - 5 - H'(l)  ~ 

1  p  0  Pg  NA  c'  NA 

B 

with  H^(l)  =  H'(l)  +  A  =  h  +  A.  With  respect  to  the  delay  T,  let  n^ 
be  the  average  path  length  of  the  successful  traffic,  replacing  H.(z)  by 
H^(z)  in  Eq.  (4.48),  we  arrive  at  =  n^  +  A;  therefore,  T  =  (n^  +  A)t 
instead  of  Eq.  (4.50). 

The  "best  case"  performance  is  obtained  by  setting  A  =  0. 

Again  we  use  the  set  of  values  of  a,  b,  c,  v  as  derived  for 
the  torus  nets,  Eq.  (3.50),  and  the  z-transform  H(z)  as  given  in  Eq. 

(A. 31).  The  scaling  is  specified  by  the  usual  C  =  hCp  and  Eq.  (4.59) 
Also  02  is  chosen  to  be  equal  to  1/64  (this  value  is  motivated  by  the 
ARPANET  where  a  buffer  holds  up  to  64  words.) 

We  first  evaluate  the  behavior  of  an  fflR  as  applied  to  a 
specific  network  size  N  =  1681.  Fig.  4.24  shows  the  plots  of  the  lower 
bound  on  p^,  the  normalized  thoughput,  versus  p,  the  normalized  load, 
and  this  for  a  set  of  values  of  m.  In  that  example,  Bg  is  chosen 
equal  to  9;  with  that  value  the  number  of  S/F  buffers  at  m  «  1 
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(no  clustering)  is  equal  to  one.  We  observe  that  the  best  performance 

(lower  bound)  is  obtained  for  m  =  1.25  from  among  the  set  of  m's 

selected  here.  With  m  =  1.25  the  maximum  improves  dramatically  from 

p  =0.02  with  m  =  1,  to  a  minimum  of  0.58.  We  also  notice  that  the 
s 

performance  improves  as  m  varies  from  1  to  1.25  and  then  it  deteriorates 
for  m  >  1.25.  Moreover,  the  curves  with  a  smaller  m  exhibit  a  slower 
decay.  This  fact  may  be  explained  as  follows:  from  Eq.  (4.51),  the 
limiting  throughput  is 

B-rh-,  H^COJNAuC 

but 

H^(z)  =  Az^'^H(z)  +  Z^'(z) 

hence  z  =  0  =>  H^(0)  =  0  (for  A  >  1,  which  is  usually  the  case,  see 
Eq.  (4.57)),  and  also  the  larger  A  is,  the  faster  is  the  convergence  of 
H’(0)  to  zero.  Furthermore,  for  a  fixed  N,  A  is  an  increasing  function 
of  m. 

If  we  limit  our  considerations  to  an  operational  range  of  p 
(i.e.,  p  is  only  allowed  to  vaiy  from  0  up  to  a  value  slightly  larger 
than  the  one  producing  the  maximum  p.^,  roughly  0  £  p  _<  1) ,  then  the 
value  of  m  which  leads  to  the  maximum  p^  also  leads  to  the  best  per¬ 
formance  over  that  entire  range. 

As  a  consequence  of  the  above  considerations,  from  now  on  we 
will  restrict  our  observations  to  the  behavior  of  the  maximum  p 

s 

(lower  or  upper  bound)  with  respect  to  N  and  for  a  set  of  values  of  m. 
Furthermore  the  maximum  p  will  be  determined  with  or  without  a  con- 
straint  on  the  average  delay  T. 
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Fig.  4.25  shows  such  a  behavior  of  max  p^,  with  no  constraint 
on  T.  Figs.  4.26  -  4.29  show  respectively  the  corresponding  average 
delay  T,  the  probability  of  success  P^,  the  number  cf  S/F  buffers  B 
and  finally  the  corresponding  relative  bound  on  the  increase  in  path 
length  A/h. 

Figs.  4.30  -  4.32  show  similar  plots  for  max  with  a  con¬ 
straint  on  the  maximum  delay  for  T  and  P^.  A  few  remarks  emerge  from 
the  observation  of  those  figures.  Those  remarks  are,  in  general, 
quite  similar  to  the  ones  stated  at  the  end  of  Section  4. 3. 2. 2,  namely 
with  regard  to  the  optimal  degree  of  clustering  m  for  a  given  N, 
and  the  feasibility  and  viability  of  hierarchical  routing.  Before  we 
proceed,  let  us  notice  that  the  non-smoothness  of  the  curves  is  due  to 
the  discrete  changes  of  B  (see  Eq.  (4.59)).  This  fact  is  more  accen¬ 
tuated  for  the  smaller  values  of  N  where  B  is  small,  and  consequently 
a  change  of  one  unit  is  relatively  noticeable.  Round-off  errors,  as 
well  as  errors  due  to  our  numerical  algorithms  for  finding  P_  and 
especially  max  (Fibonacci  search),  may  also  contribute  to  the  non- 
smoothness  of  the  curves. 

Hierarchical  routing  with  an  appropriate  degree  of  clustering, 
m,  guarantees  a  behavior  of  the  max  p^  (with  respect  to  N)  to  lie 
between  the  upper  and  lower  bound  envelopes.  It  is  quite  remarkable 
that  the  lower  bound  envelopes  (with  or  without  a  constraint  on  T)  remain 
relatively  flat  (around  0.6),  for  N  beyond  one  hundred.  Moreover  the 
upper  bound  envelopes  are  very  close  to  the  curves  obtained  in  Fig. 

4.21;  this  means  that  at  the  point  (N,  m)  corresponding  to  those 
envelopes,  the  storage  required  by  the  updates  is  relatively  negligible. 
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Figure  4.26.  Network  Delay  at  Maximum  Throughput  with  the  MHR. 
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m  =  1 ;  1.1 


WORST  CASE  PERFORMANCE 


Figure  4.27.  Probability  of  Success  at  Maximum  Throughput. 


max  p 


Figure  4.30.  Maximum  Throughput  with  a  Delay  Constraint,  Performance  of  the  MHR; 
Model  with  No  Updates  and  With  Storage  Limitation. 


188 


(This  fact  is  also  illustrated  in  Fig.  4.28).  As  a  consequence,  the 
gap  existing  between  the  lower  and  upper  bound  envelopes  is  mainly 
caused  by  the  increase  in  the  path  length  A. 

Finally,  let  us  notice  that  the  performance  of  a  non- 
hierarchical  routing  (m  =  1)  deteriorates  very  sharply  for  values  of 
N  around  1000,  and  that  for  N  greater  than  roughly  250,  hierarchical 
routing  clearly  becomes  superior. 

4.5  A  Queueing  Model  with  Updates  and  Storage  Limitation 

In  this  section,  we  intend  to  put  our  previous  efforts  to¬ 
gether  in  order  to  devise  a  model  whereby  both  line  capacities  and 
nodal  storage  used  by  the  routing  are  accounted  for. 

The  R  M|m1i  single  node  model  with  a  finite  number  of  buffers 
B  must  now  be  modified  in  order  to  accomodate  for  the  updates.  Because  of 
the  results  in  Section  4.3.1,  a  channel  can  be  modeled  by  a  HOL  priority 
queue  (m|m|1,  d|d|1).  This  is,  however,  a  major  obstacle  in  an  analyt¬ 
ical  solution.  A  more  careful  observation  of  the  analysis  of  the  HOL 
system  (Eq.  (4.33))  shows  that  the  effect  of  the  updates  is  primarily 

to  reduce  the  line  capacity  available  for  data  traffic  from  C  to 

2 

(1  -  Py)C.  The  secondary  effect  is  the  added  term  Ay/2(vi^C)  in  the 
numerator  of  Eq.  (4.33);  it  is  this  effect  that  we  will  neglect  in  the 
sequel.  Moreover,  we  will  assume  that  the  handling  of  updates  utilizes 
some  storage  (working  storage)  other  than  the  S/F  area. 

As  a  result  of  the  above  considerations,  our  study  here  is 
reduced  to  the  one  performed  in  Section  4. 4. 2. 2  where  C  is  to  be 
replaced  by  (1  -  Py)C. 


'•M 


191 


The  results  of  a  numerical  example  are  shown  in  Fig.  4.33. 

The  same  environment  which  generated  Figs.  4.25  -  4.32  is  assumed  here, 
except  that  C  is  now 

X 

yC  =  hyCp  -  -jjJi  y 
u 

The  rate  X^  selected  corresponds  to  our  earlier  compromising  choice, 
i.e.,  X^  =  y  ,  where  X^  =  0.14  pCq.  Also  recall  Eq.  (4.35), 

/  =  A 

u 

In  addition,  y  is  set  to  one. 

Fig.  4.33  shows  only  the  worst  case  performance^  of  the 
hierarchical  routing.  The  results  are  quite  comparable  to  those  in  Figs. 
4.25  and  4.30.  The  effect  of  updates  can  be  seen  in  the  drop  of  the 
minimum  normalized  throughput  by  roughly  0.05,  except  for  very  large 
N's  where  the  drop  becomes  very  small.  In  addition,  hierarchical  rout¬ 
ing  becomes  superior  to  non-hierarchical  routing  for  N  at  around  180 
instead  of  the  previous  250. 

By  con;>aring  Figs.  4.7,  4.25  and  4.33,  we  note  that  in  this 
particular  example  the  network  is  more  storage-bound  than  capacity- 
bound. 

Finally,  the  fairly  flat  shape  of  the  lower  bound  envelope  of 
Fig.  4.33  shows  that  the  effect  of  adaptive  routing  in  terms  of  nodal 
storage  and  capacity  requirements  has  been  gradually  reduced  (as  N  grows) 
so  as  to  keep  a  smooth  network  behavior. 

^The  upper  bound  envelope  obtained  in  Fig.  4.25  is  also  valid  with  this 
application. 
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Conclusion 


In  this  chapter  we  were  able  to  demonstrate  that  for  a  class 
of  symmetrical  and  distributed  networks: 

i.  In  an  ideal  situation  of  sufficient  storage  and  line 
capacity,  the  performance  of  a  non-hierarchical  routing  is,  in  general, 
better  than  the  one  obtained  with  a  hierarchical  routing,  except  at  the 
limit  of  very  large  nets  where  they  become  quite  comparable,  even  for 
enormous  table  reductions. 

ii.  With  a  more  realistic  situation  and  with  reasonable 
assumptions  on  network  growth,  the  hierarchical  routing  becomes  not 
only  a  necessity  for  large  nets,  but  it  also  preserves  a  remarkably 
good  network  performance  for  a  phenomenal  range  of  the  network  size  N. 

The  particular  numerical  examples  studied  in  this  chapter 
showed  that  the  transition  point  where  hierarchical  routing  becomes 
surely  (because  of  lower  bounds)  better  than  a  non-hierarchical  one 
occurs  for  N  between  100  and  200. 


194 


CHAPTER  5 


CLUSTERING  ISSUES  AND  SINRJLATION  OF  THE  MHR  SCHEMES 


In  this  chapter  we  address  the  following  two  issues: 

(i)  The  assignment  of  nodes  to  clusters,  clusters  to  super¬ 
clusters,  etc.,  given  an  arbitrary  network  and  a  clustering  structure, 

^  1  a  e  a ,  n ,  m^ . 

(ii)  The  evaluation  of  the  hierarchical  routing  schemes  as 
applied  to  more  general  networks  such  as  the  ARPANET. 

With  respect  to  (i) ,  first  we  define  some  nearness  measures 
among  the  network  nodes.  Then,  based  on  those  measures,  we  choose  (and 
slightly  modify)  the  "Complete  Linkage"  (CL)  technique  as  a  solution  for 
the  clustering  problem. 

With  respect  to  (ii) ,  a  simulation  program  has  been  developed 
in  order  to  evaluate  the  performance  of  the  ^t^R  schemes  (OBR,  CER)  under 
some  fairly  general  operational  environment. 

5. 1  Clustering  Issues 

The  clustering  problem  is  generally  defined  as  one  of  finding 
natural  groupings  in  a  set  of  data,  points,  objects,  etc.  Numerous 
clustering  techniques  or  procedures  have  been  documented  in  the  litera¬ 
ture.  We  mention  in  particular  the  comprehensive  book  of  Anderberg 
[ANDE  73]  and  other  publications:  [DUDA  73],  [KERN  70]  and  the  bibliog¬ 
raphies  therein. 

In  our  context  of  computer  networks,  we  need  to  define  what 
we  mean  by  a  natural  grouping.  In  what  sense  are  we  to  say  that  certain 
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nodes  are  more  likely  to  belong  to  the  same  cluster  than  others.  Tliis 
question  actually  involves  two  separate  issues:  how  one  should  measure 
the  similarity  (nearness)  between  two  nodes,  and  how  one  should  carry 
out  and  evaluate  the  partitioning  of  a  set  of  nodes  into  clusters.  In 
this  section,  we  address  these  issues  based  on  some  observations  and 
previous  conclusions  related  to  what  we  consider  as  a  good  clustering. 

5.1.1  Characterization  of  a  "Good"  Clustering 

Let  us  temporarily  restrict  our  considerations  to  a  2-level 
clustering,  i.e.,  the  partitioning  of  the  set  of  nodes  into  disjoint 
subsets  (clusters).  Recall  that  with  a  hierarchical  routing,  routing 
tables  contain  one  entry  per  node  for  nodes  in  the  same  cluster  and 
one  entry  per  cluster.  Also,  traffic  between  nodes  in  the  same  cluster 
must  follow  paths  internal  to  that  cluster.  Furthermore,  a  given  node 
must  send  all  its  traffic  to  a  given  cluster  on  the  same  path  up  to  that 
cluster.  The  above  rules  lead  us  to  distinguish  two  cases  with  regard 
to  the  internal  and  external  properties  a  cluster  should  exhibit. 

(a)  Internal  Characteristics.  The  internal  behavior  of  a 
cluster  should  take  advantage  of  the  complete  routing  information  at 
each  node.  As  a  result,  cluster  subnets  must  include  most  of  the  "best" 
paths  between  nodes  in  that  cluster.  More  importantly,  since  internal 
traffic  is  confined  to  paths  internal  to  the  cluster,  the  cluster  subnets 
must  contain  the  shortest  paths  between  its  nodes.  This  last  property 
was  assumed  to  be  true  in  the  derivation  of  some  of  the  earlier  bounds 
on  the  increase  in  the  network  path  length  (see  Chapter  3) . 
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To  take  further  advantage  of  the  complete  routing  information. 


in  case  of  non-uniform  traffic  conditions  (i-e,,  Yjj^'s  are  not  neces¬ 
sarily  equal),  clusters  should  be  composed  of  nodes  with  higher 
intertraffic  rates. 

(b)  External  Characteristics.  Due  to  the  reduction  of 

I 

'  "external"  routing  information,  a  cluster,  as  seen  from  the  outside,  is 

equivalent  to  a  single  "super-node."  An  increase  in  the  network  path 
length  has  occured  (Chapter  3)  because  of  messages  entering  a  cluster 
from  a  non-optimal  entry  point  (exchange  node).  The  single  roost  impor¬ 
tant  variable  which  affects  this  increase  is  the  diameter  of  a  cluster, 
or  more  precisely,  it  is  the  distance  between  the  entry  points  of  a 
cluster.  As  a  result,  clusters  should  be  chosen  so  as  to  correspond  to 
highly  connected  sets  of  nodes  which  thus  lead  to  a  small  diameter  and, 
moreover,  to  a  small  average  internal  path  length.  In  case  of  low- 
connected  nets,  long  chain  clusters  must  be  avoided. 

Under  non-ioniform  traffic  conditions,  clusters  should  also  be 
selected  so  as  to  minimize  intercltister  traffic  which  is  prone  to 
utilize  longer  paths. 

The  above  considerations  can  easily  be  extended  to  an  m-level 
hierarchical  clustering.  Furthermore,  they  provide  us  with  guidelines 
as  to  the  choice  of  nearness  measures  and  clustering  techniques.  Before 
we  proceed,  let  us  notice  that  existing  nets  (e.g.,  ARPANET,  TRANSPAC) 
exhibit  the  above  features  to  a  certain  extent.  In  particular,  nodes 
tend  to  be  localized  in  highly  populated  areas,  hence  they  constitute 
natural  groupings.  This  fact  is  roc^e  likely  to  become  predominant  for 
large  networks,  and  this  is  a  further  motivation  for  the  WR  schemes. 


5.1.2 


Nearness  Measures 


In  order  to  cluster  the  set  of  network  nodes,  it  is  necessary 
to  have  some  numerical  similarity  neasurements  to  characterize  the 
relationships  among  the  nodes.  In  other  words,  a  measure  of  association 
must  be  computed  for  every  pair  of  nodes  (or  for  subsets  of  nodes  in 
case  of  very  large  nets).  Any  such  measure  must  reflect  the  above 
properties  expected  of  a  cluster,  with  the  understanding  that  strongly 
associated  nodes  are  more  likely  to  belong  to  the  same  cluster. 

The  above  properties  of  a  good  clustering  may  be  simply  sum¬ 
marized  by  stating  that  a  cluster  must  preferably  be  composed  of  a 
subset  of  nodes,  as  highly  connected  as  possible,  and  which  interchange 
as  much  traffic  as  possible. 

The  above  considerations  lead  to  the  following  intuitive  near¬ 
ness  measures.  Let  s^^  denote  the  similarity  (nearness)  between  nodes 
i  and  j.  The  most  obvious  similarity  measure  between  two  nodes  is  the 
inverse  of  the  hop  distance  between  them,  i.e.. 


s.  . 


(5.1) 


hji^j  is  as  defined  in  Chapter  3. 

In  order  to  make  the  above  measure  sensitive  to  the  traffic 
pattern,  we  modify  Eq.  (5.1)  as 


s..  =  ^ 
"J  h^^. 

ij 


(5.2) 


Where  a  is  a  positive  scaling  exponent  which  reflects  the  importance  of 


the  variable  h. .  as  compared  to  Y. .  in  the  evaluation  of  s. ..  a  =  0 
13  ij  13 

means  that  s^j^^  is  only  sensitive  to  the  traffic  rates. 


The  above  external  characteristic  of  a  cluster  motivates  us  to 

seek  the  groi;qping  of  nodes  which,  seen  from  the  outside  (i.e.,  from  other 

nodes),  exhibit  some  sort  of  unity.  For  this  purpose,  we  define  a 

matching  variable  between  arbitrary  pairs  of  nodes  i,j  as  seen  from  an 

k 

arbitrary  node  k:  m^^  .  The  function  of  this  variable  is  to,  somehow, 

reflect  to  what  extent  the  routing  table  at  node  k  indicates  the  same 

k 

route  (i.e.,  same  next -node)  for  traffic  destined  to  i  or  j.  m^^  is, 

therefore,  closely  related  to  the  routing  scheme  utilized  and  can  be 

determined  either  in  a  "static"  way  or  through  measurements. 

]( 

The  static  evaluation  of  m^^  relies  on  the  computation  of  the 

paths  which  result  from  a  given  deterministic  routing  policy  (e.g., 

shortest  path  routing,  flow  deviation  method  [GERL  73A] ,  etc.). 

k 

If  we  consider  a  shortest  path  routing,  then  m^^  is  set  equal  to  one 

if  node  k  indicates  the  same  next  node  on  the  paths  for  i  and  j;  and  it 

is  set  equal  to  zero  otherwise.  Next,  we  define  the  nearness  between 

k 

nodes  i  and  j  as  the  average  of  m^^  over  all  nodes  k, 


^ij  ~  N  -  2 


(5.3) 


As  mentioned  above,  m^^  can  be  determined  through  measurements 
such  as  the  ones  performed  on  the  AJU’ANET  [KLEI  74] .  Such  measurments 
(of  interest  to  our  study  and  others)  have  been  carried  out  by  the 
Network  Measurement  Center,  NMC,  at  UCLA.  They  consist  of  collecting 
regular  snapshots  of  the  routing  tables  over  a  certain  period  of  time. 


A  matching  value  is  then  defined  for  each  snapshot,  say  y,  as  m^j(y) 
which  is  then  equal  to  one  if  node  k's  RT  shows  the  same  next  node  for 
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the  routes  to  i  and  j  (and  zero  otherwise) . 


of  snapshots,  then  is  defined  as 


If  Y  is  the  total  number 


k 


•"ij 


(5.4) 


Finally,  s^^  is  as  defined  in  Eq.  (5.3).  These  last  two  measures  will 
be  applied  below  to  the  ARPANET. 

Further  measures  may  be  defined  as  a  combination  of  Eqs.  (5.2) 
and  (5.3),  or  else  as  based  on  other  network  characteristics.  In  this 
study  we  restrict  our  considerations  to  the  above  intuitive  and  simple 
measures  in  order  to  achieve  the  network  clustering. 


5.1.3  Clustering  Techniques 

The  nearness  measures  discussed  above  may  be  used  as  input  to 
some  clustering  techniques  which  then  realize  the  grouping  of  nodes  into 
clusters.  From  among  the  existing  clustering  techniques  [ANDE  73] 
Conplete  Linkage  (CL)  method  seems  to  be  the  most  appropriate  to  our 
study.  This  method  belongs  to  the  family  of  hierarchical  clustering 
techniques.  It  operates  on  a  similarity  matrix,  s  =  construct 

a  tree  depicting  specified  relationships  among  the  nodes.  More  pre¬ 
cisely,  with  CL,  each  node  starts  as  a  cluster.  Then,  at  each  step  the 
most  similar  pair  of  clusters,  say  p  and  q,  are  merged  into  a  new 
cluster,  called  t.  The  similarity  between  the  new  cluster  t  and  some 


other  cluster  r  is  determined  as  follows: 


As  such,  s^^  is  the  similarity  between  the  two  most  dissimilar  pairs  in 
clusters  t  and  r.  (Alternatively,  if  s^^  is  a  distance-like  measure 
(e.g.,  s.j  =  h.^)  then,  s^^  =  max  tSp^,s^^}). 

The  main  property  of  this  method  is  that  all  nodes  in  the  same 
clusters  are  linked  to  each  other  at  some  maximum  distance  or  minimum 
similarity.  As  a  result,  this  method  tries  to  identify  "maximally 
connected  subgraphs"  which  is  precisely  our  main  objective. 

This  last  property  can  also  be  used  as  a  stopping  criterion  by 
defining  a  threshold  TTi,  beyond  (or  below)  which  no  merging  is  allowed 
(i.e.,  if  s^j  <  Th  -V-  i,j  then  STOP). 

Another  stopping  criterion  may  consist  of  specifying  the  total 
number  of  clusters  (n^) . 

A  shortcoming  of  CL  is  that  it  may  lead  to  various  non-optimal 
cluster  sizes  (see  Proposition  2.1)  which  may  considerably  reduce  the 
gains  in  table  length,  i,  obtained  from  optimal  size  clusters.  There¬ 
fore  some  size  limitation  must  be  introduced.  This  can  be  realized  by 
not  allowing  the  merger  of  two  clusters  whose  combined  size  is  greater 
than  the  prespecified  limit.  This  limit  may  be  chosen  as  the  optimal 
integer  size  n^  (see  Proposition  2.6).  Furthermore,  CL  can  be  extended, 
as  shown  below,  to  accomodate  a  hierarchical  clustering  with  more  than 
2  levels,  provided  we  are  given  a  vector  of  maximum  degrees, 
n  =  ’  •  •  • »  • 

Similar  to  the  above,  let  p,q  be  the  most  similar  pair  of 

clusters,  which  are  such  that  p  is  an  i^^  level  cluster  and  q,  a 

level  cluster.  Also  let  n^(p)  (and  nj(q))  be  the  number  of  i-1^^  level 
s  t 

cluster  (j-1  )  in  p  (q) .  Three  cases  to  consider  are. 
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(a)  i  =  j :  Merge  p  and  q  into  t; 

th 

If  n^^Cp)  +  n^  (q)  £  ,  then  t  is  an  i  level  cluster  of 

degree  n^(t)  =  n^(p)  +  n^ (q) . 

St 

Else,  t  is  an  i+1  level  cluster  of  degree  =  2. 

(b)  i  <  j ;  The  i^^  level  cluster  p  is  augmented  with  j-i 
levels,  each  of  degree  one,  to  become  a  level  cluster; 
then  Step  (a)  is  performed. 

(c)  3  <  i ;  The  same  as  above  for  q. 

St 

Initially,  each  node,  say  p,  is  considered  to  be  a  1  level  cluster  of 
degree  n^Cp)  =  1  (i.e.,  it  contains  one  0^^  level  cluster).  Furthermore) 

St 

n^,  the  maximum  number  of  m-1  level  clusters,  is  usually  set  to  be 

St 

large  enough  that  the  merging  of  two  m-1  level  clusters  always  results 
St 

in  a  m-1  level  cluster  (see  Step  (a)).  Either  that,  or  the  merging 
of  the  pair  p,q  is  not  allowed. 

The  above  clustering  technique  does  not  guarantee  that  cluster 
subnets  are  connected.  This  is,  however,  to  be  expected  of  such  a 
method  (see  belowT  To  accomodate  for  this  deficiency,  a  connectivity 
check  may  be  performed  before  allowing  the  merger  of  two  clusters. 

This  check  may  be  performed  by  using  a  distance- like  measure  d^^.  which 
initially  is  set  equal  to  h^^  and  after  a  merging  of  clusters  p,q  into 
t,  then 


d._  =  rain  {d  ,  d  } 
tr  ‘  pr  qr^ 


As  a  result,  a  merging  of  p  and  q  is  allowed  only  if  I* 
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Application  to  the  ARPANET 


The  above  modified  complete  linkage  method  (without  a  connec¬ 
tivity  check)  has  been  implemented  and  applied  to  the  ARPANET.  Fig.  5.1 
shows  a  map  of  a  52  (duplex)  channel,  46  node  ARPANET  as  it  was  in 
operation  on  June  25,  1974.  CL  used  as  input  the  nearness  measures  of 
Eq.  (5.3)  as  computed,  first,  from  measurements  performed  at  UCLA  and 
second,  from  a  shortest  (hop)  path  routing. 

(i)  Clustering  based  on  measurements.  Fig.  5.2  illustrates 
the  outcome  of  CL  (as  obtained  using  the  measured  nearness)  in  terms  of 
the  relative  table  length  Ji/N,  and  the  threshold  Th.  Th  is  defined  as 
the  minimum  nearness  between  any  pair  of  nodes  which  belong  to  a  common 
cluster  at  any  level.  The  curves  are  shown  for  m  =  2,  m  =  3,  and  for 
a  variable  m  which  achieves  the  lower  envelope  for  J//N.  For  m  >  3,  the 
maximum  degrees  were  assumed  to  be:  n.  =  n»  =  . . .  =  n  ,  =  3,  as  moti- 
vated  by  Proposition  2.5;  n^  was  left  unbounded. 

Tlie  curve  corresponding  to  m  =  2  was  obtained  with  no  con¬ 
straints  on  (n^,  n^) .  It  reflects  the  fact  that  with  CL,  initially, 

^  1 

all  nodes  are  considered  to  be  clusters;  hence  Th  =  1,  j;j-  =  1  +  j^~l, 

and  at  the  last  step  of  CL  all  nodes  are  in  the  same  cluster.  Thus, 
i  1 

=  1  +  jjj-  ~  1  with  Th  =  0.11.  The  minimum  (with  m  =  2)  of  8.  is  equal 
to  17  (recall  N  =  46)  and  corresponds  to  the  clustering  structure 
shown  in  Fig.  5.3.  This  structure  is  composed  of  11  clusters,  with 
the  largest  cluster  containing  6  nodes,  TTie  suboptimum  il  =  17  is  fairly 
close  to  the  theoretic  minimum  IL  =  2fN^^^'J  =  14.  ITtis  shows  the  fairly 
good  behavior  of  CL  in  this  particular  example.  Let  us  also  note  that 
at  that  minimum  point,  the  resulting  cluster  subnets  are  connected. 
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THRESHOLD  TH 

Figure  5.2.  Minimum  Nearness  (Threshold)  in  the  "Measured"  Clustering  of  the  ARPANET. 


tS 


With  a  variable  m,  the  global  suboptimal  point  is  reached  for 


il  =  14,  whereas  the  theoretic  global  minimum  is  =  11. 

(ii)  Clustering  based  on  the  shortest  path  matching.  With  this 
measure,  CL  leads  to  the  following  suboptimal  points: 

m  =  2  =>  =  15 

m  variable  =>  Z  =13 

They  represent  a  slight  improvement  over  the  previous  ones.  For  the 
sake  of  comparison  with  the  earlier  outcome,  we  show  in  Fig,  5.4,  the 
clustering  structure  which  yields  i  =  17;  this  structure  is  also  composed 
of  11  clusters,  with  the  largest  containing  6  nodes.  The  clustering 
structures  of  Fig.  5.3  and  5.4  do  not  exhibit  strong  similarities.  This 
can  be  explained  by  the  quite  asymetrical  traffic  pattern  of  the  ARPANET 
[KLEI  74]  which  led  to  the  measured  nearness. 

The  evaluation  of  the  MdR  schemes  based  on  such  clustering 
outcomes  is  the  object  of  the  next  section. 

5.2  Simulation  of  the  WR  Schemes 

5.2,1  The  Simulator 

A  PLl  simulation  program  has  been  developed^  to  help  investi¬ 
gate  the  effects  of  hierarchical  adaptive  routing  on  the  performance  of 
arbitrary  networks.  The  computer  storage  and  computational  requirements 
for  simulating  moderate  size  (a  few  hundred  nodes)  networks  placed  a 

^A  thorough  documentation  of  this  program  is  available  at  UCLA, 

Network  Measurement  Center, 
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Figure  5.3  Sample  Outcome  of  the  Quitering  Technique  Bated  on  "Measured"  Nearnett 
Measure;  S  •  17. 
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restraint  upon  the  amount  of  detail  which  could  be  implemented  in  the 
simulator.  We  list  below  the  important  features  inplemented  in  the 
simulator. 

1.  Messages  are  assumed  to  be  single  packet  messages  (hence, 
there  will  be  no  reassembly  problems)  whose  lengths  may  be  constant  or 
drawn  from  a  truncated  exponential  distribution. 

2.  All  channels  are  assumed  to  be  duplex  channels  and  error 

free. 

3.  Each  node  (IMP)  may  be  composed  of  a  finite  or  infinite 
number  of  S/F  buffers.  In  case  of  a  finite  storage,  the  sharing  with 
maximum  queue  length  (SMXQ)  scheme  is  applied  (see  Appendix  B) . 

4.  The  routing  updates  are  periodic  and  occur  every  t  seconds. 
The  updates  are  not  synchronized  among  nodes.  On  the  contrary,  initial 
update  times  are  selected  randomly  on  a  node  pair  basis  (or  simply  on  a 
node  basis) . 

The  update  rule  is  performed  as  specified  in  Chapter  3  (for 
NCR  and  MIR  schemes)  except  that  in  Algorithm  3.9,  the  channel  length 
a^g  is  now  a  random  variable.  When  an  update  occurs,  the  channel 
length  a^^  is  estimated,  as  is  presently  done  in  the  ARPANET.  If  we  let 
kj  be  the  line  capacity  index  and  k2  the  queue  index  (k^  =  4,  k2  =  1 
for  the  ARPANET) ,  then 


ts 


=  k. 


k_q 


Z'^ts 


where  q^^  is  the  current  queue  length  (in  packets)  for  line  (t,  s) . 

A  further  modification  is  introduced  to  Algorithm  3.9  in  order 
to  avoid  chain  loops  [NAYL  75].  A  change  in  routing  at  node  t  is 
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allowed  only  if  entry  (i)  at  s  is  not  pointing  to  t.  Let  NN(s,  C^(i)) 
denote  the  next  node  field  at  entry  (i)  of  node  s  RT  (see  Fig.  3.1), 
then  the  first  step  of  Algorithm  3.9  is  modified  as 

IF  H(t.  C  (i))  >  a.^  +  HF(s.  C,(i))  AND  NN(s.  C,(i))  i  t. 

5.  Acknowledgements  (ACK)  are  assumed  to  be  instantaneous.  A 
message  forwarded  from  one  node  to  another  can  be  rejected  if  there  is 
no  space  available  (recall  that  S/F  buffers  are  managed  by  the  SMXQ 
scheme) .  This  situation  triggers  a  negative  ACK  (a  departure  from  the 
ARPANET  protocol)  upon  reception  (instantaneous)  of  that  ACK,  the 
sending  node  reschedules  the  retransmission  of  that  message.  The 
message  is  deleted  when  a  positive  ACK  is  received. 

6.  With  respect  to  the  flow  control,  messages  are  not  accepted 
in  the  net  for  either  of  the  following  reasons: 

(i)  There  is  no  buffer  space  at  the  originating  node. 

(ii)  The  total  number  of  messages  in  the  network  is  equal 
to  a  given  critical  number,  NC. 

The  rejected  messages  are  considered  to  be  lost. 

This  terminates  the  description  of  the  main  features  of  the 
simulator;  we  now  proceed  with  some  applications. 


5.2.2 


Simulation  of  the  ARPANET 


Topology:  46  node  ARPANET,  Fig.  3.1.  It  is  composed  of  52  duplex 
channels  of  capacity  50  kbit/sec  per  channel,  except  for  lines  (16,  15) 
and  (45,  15)  which  are  of  capacity  230.4  kbit/sec.  The  propagation 
delay  on  the  lines  is  set  to  zero. 

KHR:  The  net  is  operated  with  a  non-clustered  (NCR)  adaptive  routing 
and  with  a  2-level  closest  entry  routing  (CER)  scheme.  The  underlying 
clustering  structures  of  CER  are  as  shown  in  Figs.  5.3  and  5.4.  In  what 
follow',  we  will  refer  to  the  representation  of  Fig.  5.3  as  AMC,  which 
stands  for  ARPANET  Measured  Clustering,  and  ASC,  ARPANET  Static  Cluster¬ 
ing,  for  the  representation  in  Fig.  5.4.  The  non-clustered  version  will 
be  referred  to  as  ANC  (N  for  "non") . 

Traffic  Characteristics;  We  assume  a  uniform  traffic  matrix,  i.e., 

=  Y  ^  i>  j;  therefore,  the  network  throughput  is  equal  to 
N(N  -  1)y  =  2070  Y*  The  message  length  is  assumed  to  be  fixed  and 
equal  to  1  kbit. 

Update  Information;  k^  =  4,  k^  =  1.  Update  interval,  t  =  0.64  sec. 

The  initial  update  times  are  chosen  randomly  on  a  node-pair  basis.  TTie 
length  of  an  update  message  is  set  equal  to  1  kbit  with  or  without 
clustering. 

Node  Operating  Condition:  We  assume  an  infinite  nodal  storage,  and  a 
nodal  processing  time  equal  to  zero. 

The  results  of  the  simulation  are  shown  in  Fig.  5.5. 

Let  us  first  notice  that  the  maximum  throughput  is  achieved  for 
^max  *  0*274  msg/sec.  This  value  can  be  computed  by  considering  the 
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Simulation  of  a  64 -Node  Torus 


We  consider  in  this  section  a  64-node  torus  upon  which  we 
impose  two  clustering  structures  (see  Fig.  5.6).  In  this  particular 
exanple,  we  account  for  the  savings  obtained  from  the  reduction  in  table 
length;  furthermore,  we  use  the  overall  best  routing  (OBR)  scheme  to 
operate  the  network.  More  specifically,  the  operational  environment  for 
the  simulation  is  as  follows: 

Topology:  We  assume  a  64-node  torus,  composed  of  128  duplex  channels  of 
20  kbit/sec  each.  The  propagation  delay  on  the  lines  is  chosen  as  equal 
to  zero. 

NHR:  The  network  is  operated  with  an  NCR  and  an  OBR  scheme.  TTie  OBR 
scheme  is  utilized  with  two  clustering  structures:  a  2-level  clustering 
coitposed  of  16  clusters  containing  4  nodes  each  (i.e.,  n2  =  16,  n^  =  4) , 
and  a  3- level  clustering  with  degree  vector  n^  =  n2  =  n^  =  4.  These 
structures  are  shown  in  Fig.  5.6;  the  first  will  be  denoted  by  16  x  4C 
and  the  second  by  4  x  4  x  4C. 

Traffic  Characteristics:  We  assume  a  uniform  traffic  pattern.  The 
offered  load  F  is  then  equal  to  F  =  N(N  -  1)y  =  4032y.  The  message 
length  is  governed  by  a  truncated  exponential  distribution.  The  mean 
length  is  set  equal  to  500  bits  and  the  truncation  point  is  at  1000  bits 

Update  Information:  Similar  to  the  above,  we  choose  kj  =  4,  k2  =  1, 

T  =  .64  sec.  The  size  of  and  update  message  l/y^*  is  fixed  and  pro¬ 
portional  to  the  table  length;  moreover,  we  assume  each  entry  to  be 
16  bit  long.  Therefore, 
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NC  => 

Jl  =  64 

=> 

l/yu  = 

1024 

bits 

4C  => 

il  =  20 

=> 

l/yu  = 

320 

bits 

4C  => 

«,  =  12 

=> 

1/yu  - 

192 

bits 

Node  Operating  Condition;  The  nodal  processing  is  set  to  zero.  Each 
node  is  assumed  to  contain  16  S/F  buffers.  These  buffers  are  shared 
between  the  4  outgoing  channels  under  the  SMXQ  sharing  scheme,  where  12 
is  the  maximum  length  of  any  queue. 

Results :  Fig.  5.7  shows  the  results  of  the  simulation  under  the  above 
operating  conditions.  We  observe  that  OBR  with  the  "16  x  4"  2- level 

clustering  achieves  a  better  performance  than  NCR  and  OBR  with 
4  X  4  X  4C.  The  difference  with  NCR  is  not  too  considerable;  this  is 
due  to  the  small  size  of  our  network  (64  nodes)  which  thus  does  not 
yield  to  great  savings  in  table  reduction.  However,  this  indicates 
that  clustering  starts  to  pay  off,  even  for  such  a  small  network  size. 
The  OBR  scheme  with  the  4  x  4  x  4C  clustering  structure  achieves  the 
worst  performance;  furtheimore,  is  saturates  faster  than  the  two  other 
cases.  This  is  due  to  the  fact  that  such  a  degree  of  clustering  (high 
for  this  small  net)  leads  to  a  significant  increase  in  path  length, 
hence  in  internal  traffic. 

Let  us  also  note  that  for  F  =  1700,  the  network  gets  into  a 
deadlock  situation.  This,  and  the  bending  of  the  curves  at  F  =  1600, 
is  due  to  the  buffer  limitation.  A  better  flow  control  scheme  is 
required  to  operate  the  net;  recall  that  the  only  control  introduced 
here  is  in  terms  of  the  total  number  of  messages  which  can  be  in  the 
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Figure  5.7. 


network  at  any  given  time.  That  number  was  selected  to  be  equal  to 
819  (approximately  4/5  of  the  total  number  of  buffers  in  the  network) 

5.3  Conclusion 

In  this  chapter  we  mainly  laid  the  framework  and  introduced 
a  methodology  in  defining  nearness  measures  between  pairs  of  nodes. 
Those  measures  were  utilized  with  a  modified  complete  linkage  method 
to  achieve  the  clustering  of  the  ARPANET.  Applications  to  larger  net 
works  should  be  investigated,  along  with  other  clustering  techniques. 

The  simulation  of  a  64-node  torus  confirmed  our  earlier 
theoretic  results  which  indicated  that  the  MHR  schemes  with  an  appro¬ 
priate  clustering  structure  eventually  (as  the  network  grows  in  size) 
outperform  the  NCR  scheme.  The  transition  point  was  seen  here  to  be 
even  smaller,  in  the  neighborhood  of  64. 


CHAPTER  6 


TOPOLOGY  DESIGN  CONSIDERATIONS  FOR  LARGE 
COMPUTER  COMMUNICATION  NET^VORKS 

6.1  Introduction 

Up  to  now,  we  were  given  a  large  distributed  computer  communi¬ 
cation  network,  and  the  problem  was  to  devise  an  appropriate  adaptive 
routing  scheme  which  would  operate  efficiently  with  a  fairly  small 
amount  of  routing  information.  The  study  of  the  MIR  routing  schemes 
showed  that,  indeed,  they  represent  a  class  of  very  satisfactory 
solutions  to  the  routing  problem  in  large  networks.  Now,  going  one 
step  back,  the  problem  becomes  that  of  designing  the  topology  of  a 
minimum  cost  communication  network  which  connects  a  large  set  of  nodes 
and  which  satisfies  a  set  of  given  requirements  (traffic,  delay, 
reliability,  etc.).  Furthermore,  it  is  desirable  that  the  resulting 
network  be  fairly  easy  to  operate  (routing,  flow  control,  adaptation 
to  failures,  etc). 

The  bursty  nature  of  computer  traffic,  as  well  as  the  con¬ 
tinuously  decreasing  cost  of  computer  hardware  [ROBE  74],  very  much 
favor  packet  switching,  as  currently  used  in  the  ARPANET  [ROBE  70], 
[HEAR  70],  [KLEI  70],  [FRAN  70],  [CARR  70],  [MCQU  74],  as  the  tech¬ 
nology  for  us  to  consider.  Several  different  formulations  of  the 
topology  design  problem  can  be  found  in  the  literature.  Generally, 
they  correspond  to  heuristic  methods  with  different  choices  of 
performance  measures,  design  variables  and  constraints.  Along  those 
formulations,  several  solutions  have  been  proposed  [GERL  73A] 


[FRAN  72  ]  and  applied  to  ARPA-like  network  designs.  Howevei ,  for 
networks  with  more  than  a  few  hundred  nodes,  present  procedures  fail 
because  of  the  large  amount  of  computer  time  and  storage  needed  to 
perform  the  optimization.  Design  procedures,  based  on  hierarchical 
clustering,  have  been  proposed  [FRAN  73],  [GERL  73B],  [COVI  74]  to 
substantially  reduce  the  computational  cost  and  the  storage  require¬ 
ment  involved  in  the  optimization  step.  Generally  speaking,  in  a 
two-level  hierarchical  design,  the  nodes  will  be  grouped  into  clusters 
and  "gates"  (special  cluster- exchange  nodes)  selected  from  each 
cluster.  Cluster  subnets  will  be  designed  separately,  and  then  a 
supemet  of  gates  will  be  designed  to  connect  the  clusters  together. 

The  assumption  is  that  nodes  in  the  same  cluster  are  most  likely  to 
be  very  few  hops  apart  in  either  a  non-hierarchical  or  hierarchical 
design.  The  approach  described  briefly  above  could  be  easily  extended 
to  more  than  two  levels  in  the  hierarchy. 

In  this  chapter,  after  stating  the  design  problem  in  a  more 
precise  form,  a  class  of  hierarchical  design  procedures  will  be 
presented.  The  focus  will  be  upon  the  determination  of  a  certain 
optimum  hierarchical  clustering  structure  of  the  set  of  the  nodes,  to 
be  used  in  the  design  phase.  In  other  words,  we  will  be  concerned 
with  the  choice  of  the  cluster  sizes  at  any  level  and  the  number  of 
levels  which  will  minimize  a  certain  objective  function  (e.g.  computa¬ 
tional  cost).  Some  related  questions,  such  as  the  decomposition  of  the 
global  performance  variables  and  requirements,  will  also  be  addressed. 

Frank  and  others  [FRAN  73]  showed  from  a  feasibility  study  of 
a  1000  node  network  that,  indeed,  hierarchical  structures  are  desirable 
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for  the  design  of  large  networks.  They  also  posed  the  same  questions 
concerning  the  clustering  structure  but  failed  to  answer  them  for  the 
general  case  (m  levels) . 

6.2  The  Topology  Design  Problem 

The  same  network  model  as  in  Paragraph  6.1  is  used  in  this 

section. 


9 


-  9 


4 


6.2.1 


Delay  Analysis 
Recall  Eq.(4.5) 


T  = 


1  NA 

F  E 


i=l 


More  elaborate  expressions  for  T  [FULT  72],  [KLEI  74]  may  be  obtained 
to  include  overhead  traffic,  nodal  processing  delay,  etc.  For  the 
purpose  of  illustrating  the  complexity  of  the  design  problem,  we  will 
limit  our  consideration  to  the  simple  expression  above.  In  the  design 
problem,  T  will  appear  either  as  the  objective  function  to  be  minimized 
or  as  a  variable  constrained  to  be  less  than  or  equal  to  a  given  value, 

T 

max 

fk  8.1 

Let  f^  ’  be  the  average  flow  (bits/sec)  produced  in  channel  i 
by  messages  traveling  from  source  k  to  destination  8.  (flow  of  commodity 
(k,  £.)),  and  let  f^  be  the  total  average  flow  in  channel  i,  given  by: 


f. 

1 


N  N 

=  z  z 

k=l  )l=l 


r(k,)l) 
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X. 

Consequently 

1  na  f. 

and  c —  f  (6.1) 

i=l  i  i 

Let  f  =  (fj,  ....  £^j^)  be  the  flow  vector,  and  let 
C  =  (Cj^,  C^,  ...»  be  the  capacity  vector.  For  the  above  equation 

to  be  feasible,  the  vectors  f  and  C  must  satisfy 

^  -s* 

f<C  i.e.,  f.  <C.  (6.2) 

The  above  relation  will  serve  as  a  constraint  in  the  design  problem. 

6.2.2  The  Communication  Cost 

To  the  i^^  link,  we  have  assigned  a  channel  capacity  C^.  C^ 
is  a  discrete  variable.  However,  for  the  development  of  efficient 
analytical  techniques,  it  is  often  convenient  to  approximate  C^  with  a 
continuous  variable  [KLEl  64],  [GERL  73A]  during  all  (or  part)  of  the 
optimization  phase,  after  which  these  continuous  values  are  discretized. 
Typical  capacity  options  and  their  cost  are  listed  in  [GERL  74].  The 
cost  of  channel  i,  denoted  d^(Cj^),  is  a  functio*  of  the  capacity  and 
the  length  of  the  channel^.  In  general,  the  cost  shows  economies  of 
scale  with  respect  to  capacity,  and  it  increases  linearly  with  the 
length  of  the  channel. 


More  elaborate  charging  structures  such  as  ATT's  new  DOS  service 
have  also  been  devised. 
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6.2.3 


Traffic  Requirement 
Average  "traffic  requirements"  between  nodes  can  be  represented 
by  a  matrix  R 


R  ^  [Y.,] 


Recall  that  (messages/sec)  is  the  average  transmission  rate  from 


source  j  to  destination  k.  In  general,  R  is  given  as  an  input  param¬ 
eter  to  the  problem.  In  some  cases,  R  is  given  as 


R  =  pR 


where  R  is  a  known  traffic  pattern  and  p  is  a  variable  scaling  factor, 
usually  referred  to  as  the  traffic  level. 

As  a  result  of  the  traffic  requirements,  the  following  multi- 
commodity  flow  constraint  [HU  69]  must  be  satisfied. 


y'  £(k»^)  y  £(R»^) 

jei  ji  ■  jt*! 


if  i  =  k 

if  i  =  Jl 

otherwise 


(6.3) 


6.2.4  Reliability 

Links  and  nodes  in  a  real  network  can  fail  with  non-zero  prob¬ 
ability,  thus  interrupting  some  communication  paths.  It  is  inportant 
to  evaluate  the  overall  network  reliability  in  the  presence  of  such 
failures.  Several  reliability  measures  are  available:  the  probability 
of  the  network  being  disconnected,  the  fraction  of  communicating  node 
pairs,  the  connectivity  of  the  network,  etc.  [FRAN  72].  However,  no 
simple  satisfactory  measure  exists  yet.  In  the  design  problem,  for  the 
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sake  of  simplicity,  connectivity  is  usually  used  as  a  reliability 
measure. 

This  concludes  our  model  description,  and  now  we  are  ready  to 
state  the  design  problem  more  formally  and  then  to  discuss  its  com¬ 
plexity. 

6.2.5  Topology  Design  Problem 

The  topological  design  problem  can  be  defined  as  follows: 

Given:  Node  locations 

Requirement  matrix  R 
Cost-capacity  functions  d^(C^) 

Minimize:  D(A,  C)  =  51  d.(C.) 

ieA  ^  ^ 

Over:  A,  C,  f  (6.4) 

where  A  is  the  set  of  arcs  which  corresponds  to  a  specific  topology, 

C  is  a  vector  of  capacities  and  f  is  a  multicommodity  flow,  such  that 

(a)  f  satisfies  the  requirement  matrix  R  (see  Eq.  (6.3)) 
,(b)  f  <  C  (see  Eq.  (6.2)) 

«=)  f  ■  7  -  Vx 

'  leA  1  1 

(d)  Reliability  constraint:  e.g.,  K-connectivity. 


Notice  that  the  above  cost  function  associates  the  entire  network  cost 
with  the  channels  themselves  [KLEI  64],  [GERL  73A].  This  does  not 


fnt  .1  loss  of  generality  with  respect  to  the  nodal  costs  which 
■i.t.  he  grouped  with  the  channel  costs. 

In  general,  there  are  2^^^  l)/2  possible  topologies.  Consid¬ 
ering  all  possible  designs  by  a  computer  is  out  of  the  question. 
Furthermore,  capacities  are  available  in  discrete  sizes.  This  means 
an  enormous  integer  optimization  problem  must  be  solved. 

The  non-linearity  of  the  time-delay  functions  and,  in  some 
cases,  of  the  reliability  measure  add  another  dimension  of  complexity 
to  the  problem. 

There  exists  no  efficient  technique  for  the  exact  solution  of 
the  topological  design  problem.  Several  heuristic  procedures  have  been 
proposed  and  implemented.  Among  them,  we  mention  the  Branch  X-change 
method  [FRAN  72]  and  the  Concave  Branch  Elimination  method  [GERL  73B]. 
They  typically  start  with  an  initial  topology  over  which  they  perform 
some  alterations  in  the  course  of  the  optimization.  Built  into  those 
procedures  and  inherent  in  the  multicommodity  nature  of  the  flow,  is 
the  determination  of  the  shortest  path  between  any  pair  of  nodes  in  the 

3 

network.  This  operation  requires  approximately  up  to  the  order  of  N 
2  3 

operations  (N  to  N  ,  N  =  number  of  nodes)  and  may  be  performed 
many  times  in  the  course  of  the  optimization.  The  overall  conputational 
complexity  corresponding  to  those  heuristics  is  estimated  to  be  on  the 
order  of  to  [FRAN  73],  [FRAN  72]. 

For  networks  with  more  than  a  few  hundred  nodes,  present 
procedures  fail  because  of  the  large  amount  of  conputer  time  and 
storage  needed  to  perform  the  suboptimization.  Furthermore,  for  such 


networks,  incremental  changes  in  the  number  of  nodes  ought  to  be 
considered  on  a  local  basis  rather  than  on  a  global  basis  as  it  is 
performed  now.  In  other  words,  addition  or  deletion  of  a  node  should 
not  affect  the  topology  of  the  entire  network  to  a  large  degree.  As  a 
result,  new  approaches  are  needed  to  deal  with  the  design  of  large 
networks. 

6.3  The  m-level  H^ierarchical  Topology  (NUT)  Design  Procedure 

Various  tools  are  available  in  the  field  of  operations  research 
to  solve  certain  classes  of  large  scale  mathematical  progranuning 
problems.  Among  those  techniques  are  the  Dantzig-Wolfe,  Bender's, 
Lagrangian  decomposition  methods  and  the  column  generation  method 
[LASD  70],  72],  [GEOF  74],  [GRAV  72].  Due  to  the  complexity  of 

our  design  problem,  the  direct  application  of  these  techniques  does 
not  seem  too  promising  and  even  less  practical.  Consequently,  a  good 
heuristic  method,  based  on  intuition  and  on  the  natural  properties  of 
the  problem,  is  desirable.  However,  exact  design  procedures  should  be 
developed  in  order  to  serve  as  comparison  for  the  evaluation  of  any 
heuristic  design  technique.  Hierarchical  design  procedures  are  good 
candidates  to  alleviate  the  tremendous  complexity  of  the  problem  and 
yet  produce  some  good  topological  design.  Such  a  procedure,  the 
m-level  fiierarchical  ^opology  (KUT)  design  method,  will  be  presented 
and  studied  in  the  rest  of  the  chapter. 

The  idea  behind  the  MHT  is  to  impose  a  decomposable  structure 
on  the  design  problem  which  .'Ul  result  in  a  set  of  smaller  sub¬ 
problems.  In  other  words,  we  will  introduce  independencies  among 


subsets  of  design  variables.  The  imposed  independencies  will  substan¬ 
tially  reduce  the  set  of  feasible  solutions  and  also,  as  a  direct 
consequence,  the  computational  cost.  In  doing  so,  there  is  the  risk 
of  eliminating  the  optimal  solution.  Therefore,  it  is  very  important 
to  seek  "natural"  deconpositions. 

v^rhe  four  major  steps  in  an  MHT  are: 

a.  m- level  hierarchical  clustering  (MHC)  of  the  set  of  nodes. 

b.  Decomposition  of  the  objective  functions  and  the  design 
constraints . 

c.  Decomposition  of  the  optimization  problem  into  a  set  of 
smaller  subproblems.  Solve  for  the  set  of  subproblems 
with  possible  iterations. 

d.  Exact  performance  evaluation  of  the  resulting  network 

and  comparison  with  optimum  performance. 

6.3.1  Step  a;  MHC  of  the  Set  of  Nodes 

The  decomposition  mentioned  above  will  be  realized  through  an 
m  level  hierarchical  clustering  of  the  set  of  nodes,  based  on  some 
appropriate  nearness  measure.  As  defined  in  Chapter  2  (see  Fig  2. 
and  Paragraph  2.3.1  for  definitions  and  notation),  the  MHC  consists  of 
grouping  the  network  nodes  (0^^  level  clusters)  into  1®^  level  clusters 
which  in  turn  are  grouped  into  2**^  level  clusters.  This  operation 
continues  in  a  bottom  up  fashion  until  the  grouping  of  the  m-2”^^  level 
clusters  into  m-1®^  level  clusters  whose  union  constitute  the  m^^  level 
cluster.  The  m  - ivel  cluster  is  the  highest  level  cluster,  and  as 

such,  it  includes  all  the  nodes  of  the  network.  This  partitioning, 
easily  describable  by  a  tree  structure  as  in  Fig.  2.4,  could  also 


proceed  in  a  top  down  manner. 

The  above  structural  similarities,  with  the  MHC  devised  for 
the  adaptive  routing  in  large  networks,  do  not  quite  carry  over  to  the 
choice  of  the  nearness  measures  and  probably  not  to  the  clustering 
techniques,  both  of  which  will  greatly  affect  the  outcome  of  the  MHT. 
The  nearness  measures  must  take  into  account  the  cost  of  the  different 
components  of  a  communication  network  (switching  nodes,  channels, 
etc.),  the  traffic  and  reliability  requirements,  the  delay  of  a  message 
in  the  net,  etc.  Some  examples  of  simple  nearness  measures  are: 


(i) 


s.  . 
11 


nearness  between  nodes  i  and  j 


(ii) 


where 


d^j  is  the  geographical  distance  between  nodes  i  and  j; 

is  the  rate  of  traffic  from  i  to  j,  and  the  exponent  e  is 
between  zero  £ind  one. 


Here  also,  the  direct  ^plication  of  the  clustering  techniques 
[ANDE  73],  [DUDA  73]  may  lead  to  various  non-optimal  cluster  sizes 
which  will,  in  general,  considerably  reduce  the  computational  gains 
[kern  70]  obtained  from  optimal  size  clusters.  One  way  to  decide  upon 
an  MIC  structure  is  to  choose  the  one  which  will  minimize  the  computa¬ 
tional  cost  incurred  in  the  MHT.  This  problem  will  be  posed  and  solved 
later  in  this  chapter. 

Along  with  the  MHC,  we  must  select  the  gates  (exchange  nodes) 
for  all  clusters  at  all  levels  (Fig.  6.1).  The  function  of  the  gates 
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from  a  given  cluster  is  to  handle  the  traffic  exchanged  between  the  set 
of  nodes  in  that  cluster  and  those  outside.  More  specifically,  the 
assumption  underlying  the  flow  of  messages  is  as  follows. 


k.-' 


Flow  Assumption  6.1 

a.  Traffic  between  nodes  in  the  same  cluster,  at  any  level, 
will  only  take  paths  which  are  internal  to  that  cluster,  i.e.,  paths 
contained  in  the  corresponding  local  subnetwork. 

b.  Traffic  between  nodes  in  different  k^^  level  clusters 

s  t 

(k  =  1,  ...,  m  -  1),  but  which  belong  to  the  same  k+1  level  cluster, 

St 

will  first  be  channeled  to  a  k+1  level  gate  of  the  originating  clus- 

s  t 

ter  over  its  local  subnetwork;  then,  it  will  take  the  k+1  layer 

s  t 

subnetwork  of  gates  to  reach  a  k+1  level  gate  of  the  destination 
cluster,  at  which  point  it  will  be  dispatched  over  the  local  subnetwork 
to  finally  reach  the  destination  node.  (Ihis  is  the  standard  procedure 
in  hierarchical  networks.) 

Fig.  6.1  illustrates  some  of  the  preceding  and  forthcoming 

definitions  for  a  3-level  hierarchical  design. 

A  local  subnetwork  of  a  k^^  level  cluster,  say  Cj^,  is  defined 

recursively  as  the  collection  of  those  local  subnetworks  associated 

with  each  k-1^^  level  cluster  which  belongs  to  Cj^  and  which 

are  connected  together  through  a  subnetwork  of  k  level  gates  called 
th 

a  k  layer  subnetwork.  Consequently,  such  a  local  subnetwork  is 
composed  of  k  hierarchical  layers,  each  of  which  is  composed  of  a 
certain  number  of  layer  subnetworks.  As  a  reminder  of  the  k  layers, 
the  subnetwork  will  also  ©ccasionally  be  called  a  k-level  subnet  or  a 
k-level  local  subnet. 
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Figure  6.1.  A  3-Level  Hierarchical  Network. 
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The  selection  of  gates  at  all  levels  is  governed  by  the  fol¬ 


lowing  assumption. 


Gate  Assumption  6.2 

Given  an  integer  vector  B  =  (Bj,  $2>  •••»  where  Bj  =  1 
and  a  selection  rule,  then,  starting  at  k  =  1,  select  Bj^^jk+1^^  level 
gates  among  the  set  of  level  gates  of  each  level  cluster 


Cj^{i^,  ...»  ij^^j).  Repeat  this  step  sequentially  until  k 


=  m  -  1.  A 


St 


network  node  is  considered  to  be  a  1  level  gate. 

Let  e.  be  a  k^^  level  gate,  and  E,  (i  ,  i  ,,...,  i,  ,)  = 
k  *  *  k  m*  m-1’  ’  k+l-* 

f  th 

{©k  /  e^  e  ^m-l’  ^k^^  level  gates  in 

the  corresponding  cluster.  Also,  let  gk^^m’  ^m-1'  "**  ^k+P 
size  of  the  abovementioned  set  of  gates.  As  a  consequence  of  the  gate 
assumption,  the  following  relation  exists  between  the  size  and  the 
degree  vectors. 


®k^^m’  ^m-1’  •••’  ^k+P  ^k  "k^^m’  •**’  ^k+P 

Notice  that  for  k  =  1,  the  set  of  1^^  level  gates  of  a  1*^  level 
cluster  is  simply  the  set  of  all  the  network  nodes  contained  in  that 
cluster,  hence 


8l<V 


m-1’ 


^2^  =  "l^^m’ 


^m-1’ 


"2^ 


which  explains  the  convention, 
to  be  feasible,  the  number  of  k 
Ck*  must  be  no  less  than  Bk^j» 


Bj  =  1.  For  the  above  selection  scheme 
level  gates  of  a  k*^"  level  cluster, 
i.e. , 
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m-l’ 


- 


k+1 


¥  k  =  1, 


m 


The  above  relation,  combined  with  Eq.  (6.5),  will  henceforth  introduce 
a  constraint,  to  be  referred  to  as  "gate  constraint,"  over  the  choice 
of  the  degree  vector  n  characterizing  the  M4C  structure. 


Gate  Constraint 


n,  (i  ,  i  1 , 
k^  m*  m-l’ 


,)  > 


"k+1 


^k+1^  -  6, 


¥  k  =  1,  2, 


■V  (i  ,  i  , 
m  m-l 


,  m  -  1 
■’  ^k+1^ 


The  choice  of  the  vector  8  will  be  mainly  related  to  the 
reliability  constraint,  Eq.  (6.4).  If  a  K-connectivity  is  to  be  im¬ 
posed  on  the  topology  of  the  network,  then  the  vector  8  must  be  such 


that  8^  ^  K  for  i  =  2,  3,  . . . ,  m.  This  is  obvious  since  the  set  of  the 


8j^  i^^  level  gates  of  an  i-1®^  level  cluster  represents  a  cut  set 


[HARA  72]  for  the  other  nodes  in  that  cluster. 

Let  a  centralized  hierarchical  network  be  defined  as  a  hier- 


th 


archical  network  whose  topology  is  such  that  all  k  level  clusters 

s  t 

(for  all  k's)  communicate  with  the  outside  through  only  one  k+1  level 
gate.  Consequently,  the  corresponding  vector  8  is 


8  =  (1,  1,  ....  1)  i.e.,  1  ¥i  =  l,  ...,m 


This  structure  is  commonly  found  in  the  hierarchical  star  networks  and 
also  in  the  organization  of  many  business  or  government  agencies. 
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6.3.2  Deconiposition  Step 

The  main  computational  gains  introduced  by  the  MHT  come  from 
the  separate  design  of  the  different  con^jonents  of  the  networks,  i.e., 
the  design  of  the  layer  subnets  at  all  levels.  To  perform  this  opera¬ 
tion,  it  is  necessary  to  define  performance  measures  for  those  subnets 
as  well  as  the  corresponding  constraints  to  be  iniposed  on  their  design. 
Then,  the  global  objective  function  and  constraints  must  be  expressed 
in  terms  of  those  associated  with  the  layer  subnets.  As  an  example, 
assuming  that  messages  travelling  over  k  layer  subnets  will  incur 
the  same  delay,  Tj^,  then  the  problem  is  to  express  the  global  average 
delay,  T,  in  terms  of  i.e.,  an  expression  of  the  form 


m 


i=l 


a.T. 
1  1 


must  be  derived.  A  particular  case  is  solved  in  Section  6.7  when 
an  optimal  NHC  structure  is  considered.  Similarly,  given  a  certain 
dollar  budget,  the  question  may  arise  as  to  how  much  one  should  allo¬ 
cate  for  the  design  of  each  of  the  layer  subnets  at  any  level.  Section 
6.7  will  address  some  of  the  above  questions. 


6.3.3  Design  and  Evaluation  Steps 

These  two  steps  will  not  be  studied  in  depth  in  this  report. 
However,  the  economies  of  hierarchical  networks,  as  well  as  their 
operational  flexibility  (routing,  flow  control,  addition  of  nodes), 
will  be  pointed  out  in  Chapter  7.  Since  the  MIT  results  in  the  design 
of  a  set  of  much  smaller  subproblems,  then  present  heuristic  design 
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methods  [GERL  73A] ,  [FRAN  72]  become  feasible  and  quite  attractive  to 
perform  the  required  job. 


In  order  to  evaluate  the  computational  complexity  in  the 
design  phase  of  the  KWT,  we  will  make  the  following  assumption: 
Computational  Cost  Assumption  6.3 

a.  The  computational  cost  incurred  in  the  design  of  a 

th  “k 

layer  subnet  connecting  a  set  of  n  k  level  gates  is  equal  to  n 
(k  =  1 ,  2 ,  ....  m) . 

b.  The  total  computational  cost  involved  in  the  design  is 
equal  to  the  sum  of  the  costs  induced  in  the  design  of  all  the  layer 
subnets . 

Recall  that  the  polynomial  form  of  the  computational  cost  is 
the  one  normally  used  [FRAN  72]  to  characterize  the  computational 
complexity  of  most  of  the  present  design  algorithms.  The  fact  that 
different  exponents,  could  be  selected,  depending  on  the  level 

of  the  hierarchy,  is  provided  to  allow  the  modelling  of  the  design  of 
hierarchical  networks  where  different  technologies  or  design  algorithms 
or  both  are  considered  at  each  level  or  group  of  levels. 

In  summary,  the  main  elements  involved  in  the  computational 
complexity  of  the  ^t^T  have  been  exposed  under  a  set  of  assumptions 
which,  hopefully,  retain  the  essential  character  of  the  class  of  hier¬ 
archical  design  procedures. 

6.4  Optimal  Clustering  Structure,  General  Case 

Given  the  number  of  levels,  m,  the  objective  here  is  to  deter¬ 
mine  the  optimal  clustering  structure,  i.e.,  the  degree  vector  n. 
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which  will  result  in  a  minimum  computational  cost.  A  similar  problem 
has  been  solved  in  Chapter  2  but  with  a  different  objective  function, 
i.e.,  the  length  of  the  routing  table  (see  Eq.  2.3)). 


6.4.1  Expression  of  the  Computational  Cost  and  Problem  Statement 

The  model  aimed  at  capturing  the  computational  complexity  of 
the  MHT  has  been  introduced  and  discussed  in  the  previous  paragraph. 

Let  G(m,  n,  a,  8)  represent  the  computational  cost  incurred  in 
the  design  step  of  the  NUT.  It  is  a  function  of  the  underlying  clus¬ 
tering  structure,  the  number  of  gates  and  the  computational  cost  of 
the  design  of  the  different  layer  subnets  which  are  respectively  char¬ 
acterized  by  (m,  n) ,  B  and  a.  Some  of  these  variables  will  be  dropped 
from  the  above  notation  when  no  confusion  is  possible.  Notation  such  as 
G,  G(m,  n),  etc.,  will  be  encountered  in  the  rest  of  the  chapter. 
Similarly,  n,  a,  6)  will  denote  the  cost  incurred  in  the  design 

of  all  layer  subnets.  We  are  now  ready  to  evaluate  G;  we  start 
with  the  sin;)le  case  of  a  3- level  clustering. 


6.4. 1.1  Expression  of  G  for  a  3-Level  Hierarchy 

Fig.  6.1  shows  a  3-level  hierarchical  clustering  of  a  set  of 

nodes.  From  Assumptions  6.2  and  6.3  and  Eq.  (6.5),  the  computational 

s  t 

cost  incurred  in  the  design  of  all  the  1  layer  subnets  is 


Gj(3,  n,  a,  8) 


"3 

L 

^3*1 


a. 


"2=1 


[8jni(i3.  i2)] 
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where  the  summations  extend  over  all  1  level  clusters  of  the  network 
(Recall  that  =  1) •  Similarly 


n,  a,  B)  = 


a. 


and 


0^(3,  n,  a,  B)  = 


Consequently,  the  global  computational  cost  is 

”3  a 

G(3,  n,  a,  B)  =  Y,  t^l"lf^3*  ^2^^  ^ 


3  a  a- 

*  L  [62n2(i3)]  *  [33113] 

^3=1 


(6.7) 


6.4. 1.2  Expression  of  G  for  an  m-Level  Hierarchy 

From  Assumption  6.3  and  Eq.  (6.5),  the  computational  cost  of 
the  design  of  a  layer  subnet  is  equal  to 

\  .  “k 

^*k^^m’  Vl’  ^k+1^^  f^k"k^^m’  ^m-1'  ^k+1^^ 

th 

The  above  expression,  summed  over  all  the  k’^  level  clusters,  repre¬ 
sents  the  computational  cost  incurred  in  the  design  of  all  the  k^^ 
layer  subnets,  i.e., 

"m  Vl'V  Vl'\-  Vl . ‘k.2>  a. 

(^(m.  n,  a,  B)  .  E  .  r  •"  L  [Sk-itV  Vl . Vl’l 

V'  vr‘  ‘k.r* 


a 


G,(«.  n,  6)  -  CS„n^) 


(6.8) 
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Notice  that  for  k  =  m,  by  convention,  we  will  set  the  multiple  summation 


to  one. 


From  Assiimption  6.3,  the  global  coii^Jutational  cost  is: 

m 

G(m,  n,  a,  6)  =  ^  GuCm*  n,  a,  6)  (6.9) 

k=l  ~  ~  ~ 


6.4. 1.3  Problem  Statement 


N,  a,  6 

G(m,  n,  a,  8)  [see  Eq.  (6.9)] 
m  and  n 

size  constraint  [see  Eq.  (2.1)]  (6.10) 

gate  constraint  [see  Eq.  (6.6)] 

m  positive  integer  variable 
n  vector  of  positive  integer  variables 


The  precise  problem  expressed  above  is  a  non-linear  integer  progranming 
problem  which  has  not  been  solved  in  its  entirety.  In  order  to  make 
progress  toward  a  solution,  we  choose  to  relax  some  of  the  constraints, 
namely,  the  integer  and  gate  constraints.  It  is  also  necessary  to  tem¬ 
porarily  freeze  the  variable  m.  The  relaxation  of  the  gate  constraint 
is  not  of  too  much  consequence,  since  it  will  be  shown  that  for  prac¬ 
tical  values  of  8,  a,  N,  m,  the  optimal  solution  will  satisfy  that 
constraint.  The  added  relaxation  of  the  integer  constraint  will  lead 
to  a  nice  analytic  solution  when  m  is  given;  (for  some  particular 
properties  of  the  vectors,  a  and  8,  an  analytical  solution  is  found  for 
the  optimal  m.)  The  resulting  real-valued  solution  is  of  considerable 
importance  for  at  least  two  reasons:  (1)  The  study  of  its  behavior. 


with  respect  to  the  variables  m,  a,  B,  will  provide  us  with  insight  as 
to  how  much  computational  gains  can  be  obtained  through  the  application 
of  the  ^f^T  design  procedure  and  into  the  choice  of  the  appropriate 
clustering  structure.  (2)  A  sub-optimal  integer  solution  could  be 
directly  obtained  from  the  real -valued  solution  (thereby  providing  an 
vqpper  bound  on  the  cost) .  As  a  consequence  of  the  relaxation  of  the 
integer  constraint,  one  question  arises  as  to  what  is  the  meaning  of  a 
discrete  sui.jnation  where  the  upper  variable  is  not  an  integer.  Con¬ 
sider  as  an  example 

"3  a- 

G  (3.  n.  a.  6)  =  E 
i,=  l 


The  summation  is  only  meaningful  in  two  instances: 

(1)  when  n^  is  integer 

(2)  when  02 (i^)  =1^2  ^3 

Then  the  summation  becomes: 

"3  a  a 

E  t^2"2^^3^^  ■  "3 ^^2^2^ 

The  solution  of  the  optimization  problem  will  show  that  clusters  at  the 
same  level  must  be  of  equal  degree  (size);  hence,  the  2"^  condition  will 


always  be  satisfied  a  posteriori. 

The  optimization  problem  to  be  solved  is  reduced  to: 


given: 
minimize: 
over: 
s.t. : 


N,  a,  6 
G(m,  n,  a,  B) 
n,  m 

size  constraint 
n  ^  0  real -valued  vector 

m  >  0  integer 


(6.11) 
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6.4.2 


itimal  Solution 


Proposition  6.1 

Given  m,  the  number  of  levels  in  the  hierarchy,  and  assuming 
that  >  1  for  all  i  =  1,  ...»  m  ,  then  the  solution  of  the  optimiza¬ 
tion  problem  6.11,  i.e.,  the  optimal  clustering  structure,  is  such  that 
(a)  All  clusters  at  the  same  level,  k  =  1,  ...,  m,  are 
composed  of  an  equal  number  of  lower  level  clusters;  i.e.,  all  nodes 
at  the  same  level  in  the  tree  representation  are  of  equal  degree. 

The  optimal  degree  vector  reduces  to  an  m-dimensional  vector, 
n  =  (nj,n2,  ...,  n^)  whose  conponents  are  the  solution  of  the  following 
set  of  difference  equations: 

"k^^m*  ^m-1*  ^k+P  "  "k  ^  k  =  1,  ...,  m  and  ..., 


1  ~  n^n,  . . .  n^ 
2  3  m 


k-1  n 

n  (a.  -  1)  o  i=l 

i*l  ‘  i  T  — 

- 5“  dt  .  n  "i  k 

,“k  k  Li=k.i  j 


n  ct. 

i=l  ID,.., 


(6.12) 


where,  by  convention  fj  =  i  J  solution  of: 

i=nH-l  ^ 


Dz  =  1 


(6.13) 


“k  “  “k-i“k-i  *  n  (“i  -  1)  k  i  5 
1=1 


Also,  B,  is  the  solution  of 
’  k 


(b)  With  this  optimum  solution,  the  minimum  computational 

cost  is: 

G(m,  a,  B)  =  (6.15) 

The  proof  of  Proposition  6.1  is  given  in  Appendix  C.  It 
consists  of  showing  when  all  other  variables  are  fixed,  that  the 
nj(*)'s  must  all  be  equal  (to  n^) .  TTien,  n^  is  replaced  by  its  optimal 
expression  in  the  objective  function,  and  the  same  operation  is  repeated 
for  the  degrees  at  the  next  level,  and  so  forth,  until  all  levels  are 
exhausted. 

As  a  consequence  of  Proposition  6.1,  the  optimal  size  vector 
also  reduces  to  an  m-dimensional  vector,  g  =  (gj^,  g^,  ...,  g^) ,  whose 
components  are  given  by 

k=l,...,m  (6.16) 

where  nj^  is  the  solution  of  Eq.  (6.12). 

The  coii5)utation  of  the  variables  Dj^,  Bj^,  nj^  may  proceed 
sequentially  by  computing  the  sequences  below. 


where 


(1)  D_,  D. ,  D 


m+1 


(2, 

'  ’  “s’  “4'  Vl 


(3)  n„,  n 


m  m-l 


,,  n, 


m 


n 


m 


m-1 

■  n  coi  -  1)  B 
1=1 _ _  _m 

a  D 
.  m  m 

a  (6  ) 


m+1 


(6.17) 


n  is  the  first  element  to  compute  in  the  sequence  (n,  } 
m  K 

Notice  that  one  can  also  solve  the  difference  equations, 
(6.13)  and  (6.14)  to  obtain  explicit  expressions  for  Dj^  and 


Explicit  Expression  of  Dj^ 

By  convention,  when  the  lower  index  in  the  product  exceeds  the  upper 
index,  then  the  product  is  set  to  be  equal  to  one.  If  the  same  situa¬ 
tion  arises  with  a  summation  sign,  then  the  sum  is  set  to  be  equal  to 
zero.  This  convention  will  apply  throughout  the  chapter. 


Explicit  Expression  for 

k-1 


j-1 


• 

r 

1 

L. 

1 

p 

1 

D.  k-1 

"  n 

-a.D. 
n.  J  J 

n  (a.  -  1) 

i=l 

i=l 

L\J 

II 

II 

(JL . 

J 

a. 

(6.)  J 

n  a. 

i=j+l^ 


(N) 


k-1 


V-k  >  2  (6.19) 
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The  proof  follows  inunediately  from  Eq.  (6.14). 

The  combination  of  Eqs.  (6.18)  and  (6.19),  at  k  =  m  +  1, 
will  provide  an  explicit  expression  for  the  minimum  computational  cost, 
G(m,  a,  3) .  The  next  natural  step  to  take  is  to  perform  the  optimiza¬ 
tion  of  G  with  respect  to  m.  This  may  easily  be  done  numerically  when 
m  is  restricted  to  a  certain  range  of  integer  values. 

Numerical  Examples 

Define  G  =  (G^^,  G^,  G^) .  Recall  that  Gj^  is  the  computa¬ 

tional  cost  for  the  design  of  the  k^*^  layer  subnets. 

a  =  (2,  2.5,  3,  3.5) 

(i)  B  =  (1,  1,  1,  1) 

m  =  1  n  =  (10^)  i.e.,  N  =  10^ 


m  =  2 


G  =  10 


n  =  (25.08 


A 

(G  =  N  for  m  =  1) 


39.86  )  Recall  that  n 

is  assumed  to 
be  continuous. 


G  =  (2.51  lo'^,  1.00  lo'^) 


3.51  10 


m  =  3  n  =  (8.65  ,  9.63  ,  12.00  ) 

G  =  (8.646  10^,  3.453  10^,  1.728  10^) 
G  =  1.38  lo'* 


m  =  4  n  =  (5.46  ,  5.22  ,  5.58  ,  6.29  ) 

G  =  (54.61  10^,  21.85  10^,  10.92  10^,  6.24  10^) 


G  =  9.36  10' 


242 


(ii)  g  =  (1,  2,  2.  2) 


m  =  1  n  (10^) 

G  =  10^ 

m  =  2  n  =  (41.16  ,24.30  ) 

jg  «  (41.16  ,  48.60  ) 

G  =  (4.117  lo'*,  1.647  lo"*) 
G  =  5.76  10^ 


m  =  3 

n  =  (17.29,  7.65 

,  7.56 

) 

g  =  (17.29,  15.30 

,  15.12 

) 

Jg  =  (1.73  lo"^,  .69 

10^,  .34 

lo"*) 

G  =  2.77  10^ 

m  =  4 

n  =  (12.26  , 

4.83  ,  4.26  , 

3.96 

) 

g  «  (12.26  , 

9.66  ,  8.52 

7.92 

) 

G  =  (1.22  10^, 

.49  10^,  .25  10^, 

.14 

G  =  2.10  lo"* 

6.5  Optimal  Clustering  Structure  with  Uniform  Design  Strategy  and 

Gate  Assignment 

This  section  deals  with  the  special  case  where  the  same  design 
procedure  is  used  at  all  levels  of  the  hierarchy.  Also,  an  equal  number 
of  gates  is  selected  from  clusters  at  all  levels,  i.e.. 
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\  “ 


V  k  >  1, 


a  >  1 


(6.20) 


\  =  S  k  >  2,  Sj  A  1 

As  a  result,  much  simpler  expressions  are  obtained  for  n  and  G  at 
optimality,  given  m.  Furthermore,  it  is  possible,  analytically,  to 
find  the  optimum  number  of  levels  in  the  hierarchy.  At  global  opti¬ 
mality,  all  the  layer  subnets  must  be  of  equal  size.  This  surprising 
result  has  an  intuitive  explanation  given  below. 

In  what  follows,  first,  the  expressions  for  n  and  G  will  be 
derived  at  optimality,  given  m.  Then,  the  global  optimum  solution 
will  be  determined  and  its  peculiar  properties  will  be  studied,  as  well 
as  its  behavior  with  respect  to  a  and  3.  Finally,  comparisons  with 
intuitive  fecisible  solutions  will  be  made. 
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With  this  solution,  the  minimum  computational  cost  is: 


G(m,  a,  6)  = 


where  Dj^  =  -  (a  -  l)*^'^  for  k  >  1  (6.23) 

Proof 

To  prove  Corollary  6.1,  we  need  to  find  the  expression  for 
□k  and  Bk/Dk- 

Expression  for  Dj^ 

From  Eqs.  (6.18)  and  (6.20) 

k  V  •  -5 

K  =  Z  0.  hoi  -  1)^  (6.24) 

j=2 

No.  Dk  -  (a  -  "e 

After  summing  on  i,  we  find  Eq.  (6.23). 

Expression  for 

From  Eq.s  (6.19)  and  (6.20) 


k-3  . 


k-i-2,  ,,i  y  k-1,  ,,i-l 
-Li«  («-l)  I- a  Ca-l)  v._i 

1)1=1  gi=2  jja 


To  evaluate  the  exponents  involved  in  the  above  expression, 
we  need  the  following  identity. 

^The  same  result  as  in  Eq.  (6.21)  has  been  derived  in  [FRAN  72]  for  m 
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for  X  1 


(6.25) 


^  =  x(nx  -  (n  l)x  ■*■  1) 


i=l  (X  -  1) 

From  Eqs.  (6.23),  (6.24)  and  (6.25): 

k-3  .  k-3 


V-  ^ir,  ^  r  i+1,-  l^k-i-2^ 

“  1-  0*  D,  .  .  =  2-  [a  -  a  (a  -  1)  ] 

i=0  i=0 


=  (k  -  2)a^  ^  -  a(a  -  1)D, 


k-1  • 


Also 


^^“2  k-i-2  i 

Z  ia  "  ^(a  -  1)" 
i=l 


k-2  v" 
a  2^  1 

i=l 

(a  -  l)^aD. 


-  (k 


k-1 


2)  (a  -  1) 


k-2' 


Finally,  the  exponent  of  6  is  equal  to  a(a  -  l)Dj^  After  substi¬ 
tuting  these  exponents  into  the  above  expression  of  and  grouping 

terms  together,  we  get: 


-,a(a-l)D, 


a 


a  -  1 


k-1 


(g  -  1) 


(g-i) 


k-1,' 


k-2 


a 

a 


k-1 


.k-1 


,g 


(6.26) 


From  the  above  equation  at  k  =  m  +  1  and  Eq.(6.15)  we  o’^tain 
Eq.  (6.22). 

The  proof  of  Eq.  (6.21)  will  proceed  by  induction.  First, 
let  us  observe  that  from  Eqs.  (6.12),  (6.20)  and  (6.26) 


D 

..k-ll 


k 


fa-ll 


k  11^-^ 

k-n  p 


la 


k-1 


D. 


r 


r 


,a(a-l)D,_-r 


Factoring  out  the  terms  in  a,  a  -  1,  B  and  using  Eq.  (6.23)  to  evaluate 
their  exponents  (to  be  denoted  by  exp  (  )),  we  find 


exp  (B)  =  a(a-l)Dj^_j  -  aDj^ 
exp  (a)  =  a(a-l)Dj^_j  ~  \  ~ 

exp  (a-1)  =  (k-l)Dj^  -  a(a-l)Dj^_j  +  (k-2)  (a-1)^’^ 


k-1 


-a 


1,  k-1 

-  ka  +  D, 


k+1 


u  k-1 
ka  -  D, 


k+1 


After  substitution,  we  get 


.k-1 


k-1 


a 


n,  = 


a 


a  -  1 


k+1 


m 

n 

i=k+l 


"k+1 


k  >  2 


(6.27) 


For  k  =  m,  the  product  term  in  Eq.  (6.27)  vanishes  to  one,  and  the 
resulting  expression  of  n^  satisfies  Eq.  (6.21).  Assuming  that  Eq. 
(6.21)  is  true  for  i=m,  m-  1,  ...,  k+  1,  let  us  show  that  it  is 
true  for  i  =  k. 

From  the  induction  hypothesis  and  Eq.  (6.24), 


m 


n 

i=k+l 


n. 

1 


,  ,  ra-k  ,  ,  m 

(^)  (^) 


N 

B 


ro+1 


(6.28) 


Substituting  Eq.  (6.28)  into  Eq.  (6.27),  and  using  Eq.  (6.23)  to 
simplify  the  resulting  exponents,  we  arrive  at  the  expression  of  nj^  as 
given  in  Eq.  (6.21).  Since  Eq.  (6.27)  is  only  true  for  k  =  2,  3,  ...,  m 
then  it  remains  to  be  proven  that  n^  satisfies  Eq.  (6.21).  This  is 
obvious  from  the  expression  of  n^  in  Eq.  (6.12)  and  from  Eq.  (6.28)  for 
k  =  1. 
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/ 


Remarks 


(i)  Size  vector:  From  Eqs.  (6.16)  and  (6.20) 

^  "l*  ®k  ^  k  =  2,  3,  ....  m 

Substituting  Eq.  (6.21)  into  the  above  expression,  we  get  a  unique 
expression  for  gj^,  valid  for  k  =  1,  2,  ...,  m. 

(ii)  Gate  Constraint:  To  satisfy  the  gate  constraint, 

Eq.  (6.6),  the  vector  n  must  be  such  that 

n^^B.  Uj^^l  k  =  2,  ...,m 


This  condition  will  always  be  satisfied  if 

>m 


/a  -  1  V  N  ^  . 

(-j;— ) 


1  N 


In 


a 

a-1 


In  the  next  section,  we  will  show  that  the  region  of  interest  for  m 
will  effectively  correspond  to  the  above  condition. 


6.5.2  Global  Optimum 

So  far,  we  have  solved  for  the  optimal  clustering  structure 
when  m,  the  number  of  levels,  is  fixed.  We  now  intend  to  let  m  vary 
and,  consequently,  to  solve  for  the  global  optimum. 

Proposition  6.2 

Under  the  condition  of  Eq.  (6.20)  and  m  being  a  real  variabl 
the  global  optimum^  clustering  structure  is  achieved  for  a  number  of 
levels 


^"Global  optimum"  is  used  to  distinguish  the  optimum  solution  with 
respect  to  m.  Also,  *  indicates  values  at  global  optimality. 


and  a  degree  vector  n* 


n*  =  3  — ii — 

1  a  -  1 


/ 

I  *  ct 

V  n*  =  - 


(6.30) 


k  a  -  1 


k  ^  2f  3j  a  • .  ^ 


The  corresponding  minimum  computational  cost  is 

G*(a.  6)  =  (I  -  i)b“ - 2L__ 

' ^  '  (a  -  1)“'^ 


(6.31) 


Proof: 


Since  Problem  6.11  has  been  solved  when  m  is  fixed,  there  only 


remains  to  minimize  G(m,  a,  B)  over  m.  Eq.  (6.22)  may  be  rewritten  as 


G=  D^j[H(m)3 

where  H(m)  represents  the  expression  between  the  outside  brackets. 
Differentiating  G  with  respect  to  m. 


c  =  #=  D-  ,  .  D, 


m+1  m+1 


'in  H( 
D 

IIH 


„  rinHWV  Vi  I*"  I'" 

"•>[  “-iJ 


which  we  define  as 


From  Eq.  (6.22) 


In  H(m)  *  a(a  -  1)D  In  3—”  -  (m  -  Da*"  In  a 

m  ct  •  1 


♦  (m  -  l)(a  -  l)"*'^  In  (a  -  1)  +  a"*  In  N 


Differentiating  Ir  H(in)  with  respect  to  m,  and  substituting  into  U,  we 


arrive  at 


a(a  -  l)(ln  B  \  [D^iD*  -  D  D'  ,] 

•  a  -  1  /  '■  m+1  m  m  m+l-* 


(m  -  l)a  (In  a) In  a  -  D;^^] 


+  (m  -  l)(a  -  1)  (In  (a'l))[D^^^  In  (a 


*  “"tl"  N)  1"  “  -  “;.ll 


+  In  (01  -  1)  -  a”'  In  a] 


D'  J 

m+l-' 


Also,  from  Eq.  (6.23) 


=  a"*"^  In  a  -  (a  -  1)”*"^  ln(a  -  1) 


=  a™  In  a  -  (a  -  I)"*  In  (a  -  1) 


(6.32) 


Evaluating  the  terms  between  brackets  in  U,  we  obtain 


-  D  D' 

m+l  m  m  ra+1 


D  ,  In  a  -  D' 
m+l  m+l 


D  In  (a  -  1)  -  D'  , 
m+l  •'  m+l 


a-‘(a-  1)"-'  in  34^ 


-Ca  -  1)"  In 
^  ^  a  -  1 


m  ,  a 


Replacing  those  expressions  in  ,  we  notice  that  the  last 

term  of  U  will  cancel  with  D'  ,,  and  we  are  left  with 

m+l’ 


dG  ^m+1  m.  ..m/,  a  \  .  B  /  a 

"n 


Since  a  >  1,  G'  is  of  the  same  sign  and  has  the  same  roots  as 


(ii)  When  all  the  k  level  clusters  are  of  equal  degree,  the 


computational  cost,  Eq.  (6.9),  becomes 

m  m 

G(in,  n.  a,  6)  =  /  11 

k=l  H=k+1  / 

Substituting  Eq.  (6.30)  into  Eq.  (6.36),  we  find 


“k 


(6.36) 


.m. 


G*  =  Cn*) 


^  (n*/8)'"*  -  1 


Replacing  n*  in  Eq.  (6.37)  by  3 


1  (n*/6)  -  1 

a 


(6.37) 


1  ‘  '  '  '  a  -  1 

operation  terminates  the  proof  of  Proposition  6.2 


,  we  get  Eq.  (6.31).  This  last 


Remarks 

1.  Size  vector;  At  global  optimality,  the  size  vector  is 

such  that 

g*  =  6  k  =  1,  2,  ...,  m^  (6.38) 

Eq.  (6.38)  indicates  that  at  optimality,  all  the  layer  subnets  are  of 
equal  size  which  depends  only  on  a  and  0.  The  explanation  of  this 
very  simple  and  interesting  property  will  be  the  object  of  Section 
6.5. 3. 

2.  Gate  constraint:  From  Eq.  (6.38),  since  a  >  1, 

gjj  _>0  ksl,2,  ...,m 

Hence,  the  gate  constraint,  Eq.  (6.6),  is  always  satisfied  at  opti¬ 
mality.  Moreover,  as  noticed  at  the  end  of  Section  6.5.1,  the  optimal 
solution,  given  m  £  m,,,  will  also  satisfy  the  gate  constraint. 

3.  Number  of  layer  subnets:  Eq,  (6.31)  may  be  rewritten  as 


(6.39) 


G.  =  (^- l)ca-  l)(e^) 

-  ^ 

From  Eq.  (6.38),  we  conclude  that  the  number  of  layer  subnets  is 

NL*  =  -  i)(a  -  1)  (6.40) 

The  above  results  could  also  be  derived  by  counting  the  number  of 
clusters  in  the  tree  structure  (Fig.  2.4).  Let  NCj^  be  the  number  of 
level  clusters;  then,  from  Eq.  (6.12) 

m 

NC.  =  n  n.  k  =  1,  2,  ...,  m  (6.41) 

i=k+l 

At  optimality,  given  m,  NCj^  is  given  by  Eq.  (6.28);  and  at  global 
optimality 

-  (jrrr) 

Hence,  the  total  number  of  layer  subnets  is 

NL.  =  NC*  .  Co  -  1)((s4t)  -  V 

The  substitution  of  Eq.  (6.34)  into  the  above  equation  gives  Eq.  (6.40) 

4.  Cost  distribution:  The  con^utational  cost  incurred  in  the 
design  of  all  the  k*^  layer  subnets  is 

=  NC*(b  k=l,  ...,m*  (6.43) 

which  is  to  be  compared  with  the  total  cost  G,,.  For  practical 
situations,  N/6  »  1;  hence,  from  Eq.  (6.31) 
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N3 


a-1 


a“  N  /„  g 

“  nj  Qt  -  1  ) 


(a  -  1) 


Since  NC*  =  N/n*,  then  substituting  Eq.  (6.43)  for  k  =  1  into  the 
above  equation,  we  get 


G,,  -  aG*  (6.44) 

St 

which  says  that  the  design  of  the  1  layer  subnets  represents 
approximately  1/a  of  the  total  computational  cost. 

5.  Behavior  of  the  optimal  tree  structure  with  respect  to  a: 
We  will  assume  that  3=1.  Let  us  define  a  regular  tree  of  degree  K 
as  a  tree  whose  nodes  are  all  of  equal  downward  degree  K,  except  for 
the  leaves  (downward  degree  zero),  [KNUT  69].  As  an  example,  a  binary 
tree  is  a  regular  tree  with  degree  2. 

The  global  optimum  solution  given  in  Eq.  (6.30)  becomes,  for 

3  =  1* 

n*  =  k  =  1,  2,  ...,  m*  (6.45) 


We  are  interested  in  the  set  of  a's  which  yield  to  integer  solutions, 
i.e.,  to  regular  tree  structures.  If  K  is  the  degree  of  such  trees, 
then  a  must  be  such  that 


a 


a  -  1 


K 


<^> 


(6.46) 


Moreover,  if  a  regular  tree  of  degree  K  is  composed  of  m*  levels,  then 
it  contains  k"**  leaves.  Consequently,  there  is  a  one  to  one  corres¬ 
pondence  between  the  set  of  regular  trees  of  degree  K  (K  ^  2  integer) 

whose  number  of  levels  is  m*  (integer  ^  2)  and  the  global  optimal 

K  in 

solutions  of  Problem  6.11,  where  a  =  r: - ^  and  N  =  K  *  for 
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K  =  2,  3,  Notice  that  the  above  set  of  a's  is  contained  in  the 

interval  (1,  2]  of  real  values,  i.e.,  1  <  a  £  2.  Also,  a_=_^  corres¬ 
ponds  to  a  binary  tree  representation. 

6.5.3  Irreducibility 

The  sinplicity  of  the  solution,  Eq.  (6.38),  obtained  at  global 
optimality  leads  us  to  consider  a  more  intuitive  approach  toward  its 
derivation,  based  on  irreducibility  considerations. 

Definition;  An  irreducible  set  of  nodes  is  such  that  no  coiq>utational 
gain  can  be  obtained  through  the  application  of  the  ^tiT  for  the  design 
of  the  corresponding  communication  network. 

Lemma  6.1;  At  global  optimality,  each  layer  subnet  to  be  designed  must 
correspond  to  an  irreducible  set  of  nodes. 

Proof  (by  contradiction) ;  Assume  that  at  optimality,  there  exists  at 
least  one  reducible  unit.  Then,  the  application  of  the  MIT  to  design 
the  network  corresponding  to  that  specific  unit  will  reduce  the  com¬ 
putational  cost  for  that  unit  and  consequently  for  the  total  design. 

This  contradicts  the  fact  that  we  have  reached  the  global  optimum. 

Notice  that  the  application  of  the  MIT  to  a  reducible  imit  will 
preserve  the  hierarchical  structure  of  the  network  as  defined  in  Section 
6.3.  This  would  not  be  true  if  the  a's  or  B's,  or  both,  were  not  equal 
at  all  levels. 

Assumption;  Our  intuition  leads  us  to  assume  that  for  a  given  a  and 
3,  the  size  of  an  irreducible  unit  is  unique. 
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Fact  6.1:  At  optimality,  all  layer  subnets  must  be  of  equal  size,  q. 


and  the  number  of  levels  must  be  equal  to  m,^. 


q  =  e 


a 


a 


and 


1  N 
In  ^ 


In 


a 


(6.47) 


a  -  1 


Proof;  As  a  consequence  of  Lemma  6.1  and  the  above  assumption,  at 
optimality  all  the  layer  subnets  are  of  equal  size.  Let  q  be  that 
size;  then  from  Eq.  (6.5) 


(6.48) 


.  n,  =  q/6  k  =  2,  . . . ,  m 


From  the  size  constraint,  Eq.  (2.1) 


(6.49) 


Substituting  Eqs.  (6.48)  and  (6.49)  into  Eq.  (6.9),  we  arrive  at 

,a/m 


G=(f  -  l)  6“ 


(N/g)’ 


(N/6)^^"'  -  1 


(6.50) 


In  order  to  find  the  optimal  number  of  levels,  there  remains  to  mini¬ 
mize  G  with  respect  to  m.  Differentiating  G  with  respect  to  m,  we  get 


(6.51) 


In  practical  situations,  N/g  >  1.  Under  this  assumption,  dG/dm  is  of 
the  same  sign  and  has  the  same  roots  as  a  -  (a  -  1)  (N/g)  Equating 
this  last  term  to  zero,  we  get  m  =  m*,  as  given  in  Fact  6.1. 


From  the  sign  of  dG/dm,  we  note  that  G  is  minimum  for  m  =  m*.  This 
terminates  the  proof  of  Fact  6.1. 


Conclusion:  The  assumption  introduced  in  this  section  is  intuitively 
justified  a  posteriori,  since  the  solution  of 


In 


1 


with  respect  to  N  is  unique  and  equal  to  =  Ba/(a  -1).  In  other 
words,  the  optimal  number  of  levels  for  a  design  of  a  network  with 
Ba/(a  -  1)  nodes  is  equal  to  one. 


6.5.4  Variations  and  Limiting  Behavior  of  the  Optimal  Solution  with 

Respect  to  the  Design  Variables 

The  behavior  of  the  computational  cost  and  the  degree  vector 
at  optimality,  given  m,  will  be  studied  with  respect  to  m.  Of  inpor- 
tance  is  the  fact  that  £(m)  converges  fairly  fast  to  its  minimum  value, 
versus  m,  and  remains  very  close  to  that  value  as  m  grows  to  infinity. 

A  similar  phenomenon  will  characterize  the  behavior  of  the  Oj^'s  versus 
m.  We  will  also  notice  that,  as  a  goes  to  infinity,  the  optimal 
solution,  given  m,  is  such  that  all  layer  subnets  must  be  of  equal 
size;  such  a  property  was,  for  finite  a,  only  true  at  global  optimality, 
i.e. ,  m  =  m^. 

The  variations  of  G*  and  m*  with  respect  to  a,  B,  N,  do  not 
disclose  any  remarkable  property,  except  that  m*  rapidly  reaches  its 
asynptotic  value  as  a  becomes  greater  than  2. 

In  what  follows,  we  will  restrict  the  study  to  the  practical 
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situation  where 


e>i,  g>i 


(6.52) 


6.5.6. 1  Behavior  of  the  Optimal  Solution  and  Objective  Function, 


Given  m 


a.  £(in)  versus  m:  Section  6.5.2  showed  that  as  m  increases 
from  zero  to  infinity,  G(m)  decreases,  reaching  its  minimum  at  m  =  m,, 
(note  m*  >  0  because  of  Eq.  (6.52)  and  a  >  1),  and  then  increases.  The 
limit  of  G(m) ,  as  m  goes  to  infinity,  is: 


G  A  limit  G(m)  *  B 


m  “ 


a-1  g 
(ot  -  1)' 


(6.53) 


Proof 


Let  us  first  derive  some  intermediary  results.  From  Eq.  (6.23) 


Hence,  when  m  goes  to  infinity 


D  /D  ,  -*■  - 

m  m+ 1  g 


m(g  -  1] 


(6.54) 


ra+1 


-(mg™)/D^ 


Consequently,  substituting  these  limits  into  Eq.  (6.35),  we  arrive  at 
Eq.  (6.53).  Notice  that  the  difference 


,g  g 


G  -  G*  =  B  - = - r 

"  (a  -  1)“-* 


2S8 


is  independent  of  N,  and  that  the  relative  difference 


G*  N  -  e 


goes  to  zero  as  N  becomes  large. 

In  what  follows,  four  sets  of  figures  (Figs.  6.2  -  6.9),  each 
corresponding  to  a  specific  value  of  the  pair  (a,  3),  {(3,1),  (3,3) 
(5,1)  (5,3)},  will  be  shown.  In  each  set,  the  functions  ^(m) 
and  ^(m)/G*  are  plotted  with  respect  to  m  and  for  several  values  of 
N,  N  =  {50,  10^,  10^,  10^,  10^,  10^}.  The  curves,  £(m)  versus  m, 
illustrate  the  initially  decreasing,  then  slightly  increasing  and 
asyn^Jtotic  behavior  of  the  optimal  computational  cost  for  a  fixed  m. 

By  comparing  G(m)  to  N  ,  we  are  able  to  appreciate  the  enormous  com¬ 
putational  gains  obtained  through  the  application  of  the  ^t^T. 

The  curves,  G(m)/G*  versus  m,  illustrate  the  "locking"  effect 
whereby,  once  G(m)  reaches  its  minimum  value  of  m  =  m*,  Eq.  (6.29),  it 
will  appear  as  if  it  remains  indefinitely  at  that  value.  They  also 
illustrate  the  fairly  fast  convergence  of  £(m) toward  a  value  close  to 
the  minimum,  for  a  value  of  m  relatively  smaller  than  m*.  This  indi¬ 
cates  that  we  may  actually  obtain  most  of  the  computational  gains  with 
hierarchical  structures  whose  number  of  levels  (m)  is  much  smaller 
than  the  optimal  ones  (m*) .  Finally,  the  four  sets  are  present  in 
order  to  indicate  the  effect  of  the  design  parameters  a  and  6  on  the 
behavior  of  the  aforementioned  functions. 

b.  n,  versus  m 
k _ 

Differentiating  Eq.  (6.21)  with  respect  to  m,  we  find,  after 


simplification 


(in  --  — V- 
\  a  -  1 


dn,  ,  ,Nin-k  m+k-1 

k  _  (ot  -  1)  a 

dm  '  "k  ^2 


•)(^) 


D 


1  1  N  ,  a 

1  -  In  75-  +  m  In  - r 

3  a  -  1 


m+1 


For  a  >  1,  dn. /dm  is  of  the  same  sign  as  the  expression  in  brackets  (to 
K 

be  denoted  by  Z) .  From  Fig.  6.10,  we  notice  that  Z  has  a  unique  root. 


m^,  which  is  such  that 


m*  <  m„  <  m*  +  - 

In - r 

a  -  1 

Also,  for  m  <  m^,  Z  is  negative  and,  conversely,  for  m  >  m^,  Z  is 

positive.  Z  is  equal  to  zero  at  a  value  of  m  ,  m^,  which  is  independent 

of  k.  Consequently,  as  m  varies  from  zero  to  infinity 

(i)  all  Hj^'s,  k  <  m^,  decrease,  reach  a  minimum  and  then 
increase  toward  their  limits. 


(ii)  all  k  >  iiIq,  increase  toward  their  limits. 

The  limit  of  for  k  fixed,  as  m  goes  to  infinity,  is 


limit  n  =  8  — 

a-1 

m  -►  00 


limit  n,  = 


a 


k  a-1 


k  >  2,  k  fixed 


m  ->■  00 

The  proof  is  immediate,  after  observing  that 

.  ,^m-k  k-1 

, .  .  ^  (a-1)  a  , .  .  ^  m 

limit  m  - -  limit  — 

D  ,  a 

m  ->•  00  ni+l  m  -»■  00 


(“jii.)" 
'  a  ' 


m-k 


=  0 


(^) 


and  then,  substituting  this  limit  into  Eq.  (6.21).  If  we  let  k  vary, 
more  particularly,  if  we  let  k  =  m,  then 


limit 


ma 


m-1 


D 


=  +  00 


m  -►  00  uj+i 


Hence 


limit  n  =  0 


m 


m 


Moreover,  In  n^  has  as  an  asymptote  a  straight  line  of  equation 


m  ,  a  ,  a  /  N  \ 

-  —  In  - r  +  In  - r  (o-) 

a  a-1  a-l\8' 


1/a 


The  behavior  of  nj^  versus  m,  for  k  <.  m»,  exhibits  the  "locking"  effect 
as  previous  described.  Moreover,  this  phenemonon  is  quite  remarkable 
here,  since  the  limit  of  nj^,  as  m  goes  to  infinity,  is  equal  to  the 
value  of  nj^  at  global  optimality.  Fig.  6.11  illustrates  the  above 
properties. 
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6. 5. 4. 2  Behavior  of  G»  and  with  Respect  to  a,  B,  N 


a.  G*  versus  a;  Differentiating  Eq.  (6.31)  with  respect  to 
a,  we  find 


dG*  ^  /N 
da  ~  \  3 


Under  the  conditions  of  Eq.  (6.52)  and  a  >  1,  G*  is  an  increasing 
function  of  a.  As  a  goes  to  infinity,  G*  will  be  asymptotic  to  the 
expression,  (N/6  -  l)e(a  -  1/2) 3°*. 

If  3=1  the  asymptote  is  a  straight  line,  which  implies  that 

at  the  limit,  G^  will  show  a  linear  growth  with  a. 


b.  G*  versus  N;  Eq.  (6.31)  shows  that  G*  varies  linearly  with 
N.  It  is  also  inportant  to  notice  that  the  optimal  hierarchical  struc¬ 
ture  reduces  the  conputational  cost  from  the  order  of  steps  to  the 
order  of  N  steps! 

c.  G*  versus  3:  Differentiating  Eq.  (6.31)  with  respect  to 
3,  we  arrive  at 


ot 

3g-=- - ^6  [(o-I)N-a8]. 

(a  -  1) 

Consequently,  G*  is  an  increasing  and  then  decreasing  function,  as  8 
varies  from  1  to  N  (for  N  >  ^  and  under  the  conditions  of  Eq. 

(6.52)).  Equating  the  derivative  to  zero,  we  find 


3 


0 


(6.55) 


Substituting  Eq.  (6.55)  into  Eq.  (6.31),  we  arrive  at  the  maximum  value 
of  G„  (with  respect  to  3),  G*(3  =  3q)  =  This  maximum  cost  corre¬ 

sponds  to  a  design  with  no  hierarchical  structure.  This  last  equation 
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checks  with  the  fact  that  for  a  set  a,  6,  N,  satisfying  Eq.  (6.55),  the 
optimal  number  of  levels,  m*,  is  equal  to  one  (Eq.  (6.29)),  i.e.,  for 
such  a  set,  a  non -hierarchical  design  is  optimal. 

d.  m^  versus  N  and  g:  Recall  Eq.  (6.29), 

m^^  =  (In  N/3)/(ln  a/(a  -  1)).  Hence,  m^  varies  as  a  logarithm  of  N/6. 
This  logarithmic  behavior  appears  to  be  characteristic  of  hierarchical 
structures  [CHOW  74],  (see  Chapter  2) . 

e.  m»  versus  a:  Differentiating  Eq.  (6.29)  with  respect  to 


a,  we  find 


dm* 

da 


In  N/6 


a(a  -  1)  [In  (a/ (a  -  1))] 

Consequently,  for  a  >  1,  m^  is  an  increasing  function  of  a.  The 
limiting  values  of  m  are 


a  ->  1*  =>  I 


m*  0 


dm* 


da 


-»■  +  «> 


()(-►+  00  => 


dm* 

da 


In  N/6 


The  asymptote,  as  a  goes  to  infinity,  is  the  straight  line 


=  (a  -  j)  In  N/6 


Fig.  6.12  shows  the  plots  of  m,^  versus  a  for  several  values  of  N/6- 
Notice  that  m*  rapidly  reaches  its  asymptotic  value  as  a  becomes  greater 
than  2.  Also, 

"•*  -  1  -  N/ff  -  1 
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which  means  that  for  1  £  ot  ^  partitioning  is  required: 

furthermore,  a  loss  will  be  incurred  if  we  try  to  do  so. 


6.5.4. 3  Limiting  Behavior  of  the  Optimal  Solution,  Given  m,  with 


Consequently,  as  a  goes  to  infinity,  the  optimal  solution,  given  any  m, 
is  such  that  all  layer  subnets  are  of  equal  size.  Such  a  property  was 
found  to  be  true  for  finite  a,  only  at  global  optimality. 
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6. 5. 4. 4  Optimal  Clustering  when  a  ^  1 


Notice  that  if 


1 


then,  from  Eq.  (6.29),  we  conclude  that  m^  ^  1,  which  means  that  no 
clustering  is  needed  for  oi  satisfying  the  equation  above.  Let  us  show 
that  this  result  holds  true  if  0  <  a  <  1. 


Proposition  6.3 

Under  the  condition,  0  <  ct  ^  1,  the  global  optimum  of  Problem 
6.11  (with  equal  ot's)  is  achieved  for  m  =  1,  i.e. ,  no  clustering  is 
required. 


Proof : 


By  contradiction,  assume  that 
proof  of  Proposition  6.1  (^pendix  C), 
and  all  degrees,  nj^(i^,  ...,  (k 
reduced  problem  with  respect  to  "^(i^. 


m  >  1;  then,  similar  to  the 
let  us  fix  m  (m  >  1,  integer) 
=  2,  ...,  m) ,  and  solve  the 
...,  12)^5.  i.e.. 


NC, 


min:  G,  = 


L  [n,(i)] 
i=l  ^ 


a 


over:  nj(i) 

NCj 

s.t.:  ^  n.(i) 

i=l  ^ 


=  N 


(see  Eq.  (6.8)) 


(see  Eq.  (2.1)) 


nj(i)  ^  0  real  variable 

where  NCj  denotes  the  number  of  1  level  clusters,  and  nj(i)  denotes 
the  size  of  an  arbitrary  1^^  level  cluster.  Since  1,  then  [nj(i)]“ 
is  a  concave  function;  hence,  the  objective  function  above,  Gj,,  is  a 
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concave  function.  Also,  the  set  of  feasible  vectors,  n^  =  {nj(i)}  is 

a  bounded  convex  polyhedron  whose  vertices  correspond  to  vectors  with 

all  zeros  except  for  one  component  equal  to  N,  i.e.,  (N,  0,  0,  ...,  0) 

and  its  permutations.  As  a  result,  we  are  faced  with  the  optimization 

of  a  concave  function  over  a  bounded  convex  polyhedron,  whose  optimal 

ot 

solution  is  well  known  to  be  at  a  vertex.  At  any  vertex,  =  N  ; 
consequently,  it  is  the  minimum.  Notice  that  this  result  is  true  for 
any  NC^,  i.e.,  for  any  nj^(i^,  ...,  .  (k  =  2,  ...,  m) .  In  other 

words,  for  any  vector  n  satisfying  the  size  constraint,  Eq.  (2.1), 

Gj  ^  which,  combined  with  Eq.  (6.9),  gives 

a  "* 

G(m,  n,  a,  3)  ^  N  +5^  Gi.(in»  n,  a,  3)  ¥  n,  feasible 


Hence, 


G(m,  n,  a,  3)  >  N  ¥  n,  feasible 


Consequently,  the  optimal  solution  of  Problem  6.11,  for  integer  m  >  1, 
must  also  satisfy  the  above  inequality  which  is  a  contradiction,  since, 
if  m  =  1,  G(m,  n,  a,  3)  =  n“. 


Remark 


Proposition  6.3  holds  true  for  the  general  case  where  the 


Oj^'s  are  arbitrary,  if  <  1. 


6.5.5  Con^arison  of  the  Optimal  Solution  with  TWo  Feasible  Solutions 


The  Q-solution  models  the  situation  where  all  layer  subnets  are  of 
equal  size. 

q  =  k  =  1,  2,  . . . ,  m  (6.58) 

Notice  that  the  two  solutions  are  identical  when  3=1.  The  optimal 
solution,  given  m,  will  be  referred  to  as  the  G-solution. 


6. 5. 5.1  The  Q-Solution 

This  solution  has  been  studied  in  detail  in  the  course  of  the 
proof  of  Fact  6.1  (Section  6.5.3).  Of  significance  is  the  property 
that  the  Q-solution  satisfies  Proposition  6.2  at  global  optimality. 

The  corresponding  computational  cost  (denoted  by  Q)  is  given  by  Eq. 
(6.50).  There  remains  to  find  the  limit  of  Q  as  m  goes  to  infinity. 
From  Eq.  (6.50), 

limit  Q  s  +  oo 

m  -*•  00 

The  corresponding  asymptote  is  a  straight  line  whose  equation  is 


6. 5. 5. 2  The  R-Solution 

From  Eq.  (6.57)  and  the  size  constraint,  r  =  With  this 

assignment,  the  computational  cost  is 


Differentiating  Eq. 


R  - 


N  +  8 


a  N  -  N 


N 


1/m 


(6.60)  with  respect  to  m. 


1/m  ■ 

-  1  . 
we  find 


(6.60) 
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r' 

’ 


dR  In  N  r.a,..  ,..,l/m 


dm  2-„l/m  ,.2 

m  (N  -  1) 


[ 


3  (N  -  1)N 


-  (a  -  1)(N^/™  -  1)  (6“(N  -  +  NCN^'^"’  -  l))J  (6.61) 

If  we  let  Y  represent  the  expression  in  brackets  and  X  =  then 


Y  =  -(a  -  1)(N  -  3“)X^ 


^2(a  -  1)N  +  e“(2N  -  a(N  +  1)))  X  +  (oi  - 


1)N(3“  -  1) 

(6.62) 


dR 

is  of  the  same  sign  as  Y.  Notice  that 


X  -»•  -  «  =>  Y  •>  -  “ 

X  =  =>  m  ->  +  oo  =>  Y  =  (N  -  1)3“  >  0  (6.63) 

X->-  +  oo  =>  jn->0*  =>  Y-»--“ 


Ihus,  there  exists  a  unique  root,  Xj^,  which  is  greater  than  one.  Let 
be  such  that 


In  N 
in  X^ 


(6.64) 


From  the  above  remarks,  we  conclude  that  R  is  minimum  for  m  =  mj^. 

There  remains  to  find  the  limit  of  R  as  m  goes  to  infinity.  From  Eq. 

(6.60),  limit  R  =  +  <».  The  corresponding  asynptote  is 
m  -*■  00 

Rco  =  (N  -  1)  m  +  1  +  (a  -  2)3“j.  (6.65) 


6.5.5. 3  Comparison  of  the  Three  Solutions 

Let  us  first  summarize  the  properties  of  the  three  solutions. 
1.  All  three  solutions  possess  an  optimal  number  of  levels. 


278 


®*  (Eq.  (6.29))  for  G  and  Q,  and  lUj^  (Eq.  (6.64))  for  R. 

2.  For  large  values  of  m,  the  G-solution  is  asymptotic  to  a 
constant;  whereas,  the  Q  and  R  solutions  asymptotically  grow  linearly 
with  m.  In  other  words,  for  m  ^  G,  contrary  to  Q  and  R,  is  not 
sensitive  to  m. 

3.  The  G  and  Q  solutions  meet,  for  any  a,  at  m  =  m*  and,  for 

a  ^  ,  at  any  given  m. 

4.  The  Q  and  R  solutions  are  identical  for  3=1. 

Figs.  6.13  and  6.14  illustrate  the  above  properties.  Moreover 
we  notice  that,  as  expected,  G  is  always  smaller  than  or  equal  to  Q  and 
R,  and  that,  for  3  r  1,  Q  is  certainly  better  than  R,  for  values  of  m 
in  the  neighborhood  of  m*,  but  not  necessarily  outside. 


6.5.6  Suboptimal  Integer  Solution 

We  propose,  below,  a  heuristic  algorithm  which  generates  an 
integer  solution  from  the  optimal  real-valued  solution. 


Algorithm;  Given  N,  m,  3 

1.  M  m,  OLD  N  ■*-  N  [save  old  values] 


2.  Compute  n^  from  Eq.  (6.21) 

ndj^  fn^l  ceiling  operation 


adjust  size  constraint 


m  m  -  1 


If  m  >  1  GO  TO  2. 

m 

Else  nd-  ■*-  fwl,  ND  IT  "d- 


3.  m  -f-  M,  N  ■«-  OLD  N 


4.  DO  i  =  2,  3,  . . . ,  m 

OLD  nd.  nd. 

1  1 

"'•i  "'•ii 

(ND) (nd. ) 

VTr\  ^ 


5.  Con5)ute  corresponding  conputational  cost 

m  /  m  \ 

GD  =  £  n  nd  )  (e  (nd  )“) 
k=l  \i=k+l  V  '  ' 


The  main  idea  underlying  the  above  algorithm  is  to  always 
compute  the  degree  of  the  highest  level  cluster;  then  after  readjustment 
of  the  number  of  nodes,  N,  the  operation  is  repeated  for  an  equivalent 
problem  with  one  less  level.  This  idea  is  incorporated  in  Step  2.  The 
purpose  of  Step  4  is  to  adjust  the  integer  solution,  {nd^}  ,  obtained 
in  Step  2,  in  order  to  reduce  the  slack  in  the  size  constraint,  i.e., 
in  order  to  reduce  the  difference  between  ND  and  the  given  value  of  N. 

The  global  minimum  may  easily  be  obtained  when  m  is  restricted 
to  a  certain  range  of  values.  This  could  be  done  by  confuting  the  cost 
incurred  for  each  m  (using  the  above  algorithm)  and  then  choosing  the 
number  of  levels  which  corresponds  to  the  minimum  cost. 

The  performance  of  the  above  algorithm  is  illustrated  in  the 
following  sets  of  figures.  Figs.  6.15  -  6.18.  These  figures  must  be 
compared  to  those  corresponding  to  the  real -valued  solution.  Figs. 

6.2  -  6.9.  We  notice  that  the  plots  of  G(GD)  versus  m  roughly  exhibit 
the  same  shape  as  found  for  the  real-valued  solution.  The  two  solutions 
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are  also  fairly  close  for  small  values  of  m.  Moreover,  the  algorithm 
seems  to  work  better  in  the  smaller  region  of  a,  and  it  appears  that  it 
is  not  that  sensitive  to  6.  The  plots,  GN/G*,  show  the  initial  fast 
drop  in  computational  cost  followed  by  a  levelling  behavior  to  a  value 
within  2G^  for  a  *  3  and  SG,^  for  a  =  5,  and  which  seems  to  be  fairly 
independent  of  N. 

6.6  Application  to  Other  Special  Cases 

This  section  deals  with  the  following  two  cases: 

(i)  uniform  design  strategy,  variable  gate  assignment,  i.e., 

Qij^  =  oi  Vk=l,  ...,m  and  3j^'s  variables. 

(ii)  proportional  assignment  of  gates. 

TTiis  section  is  treated  in  Appendix  C.  A  summary  of  the 
results  is  presented  below. 

In  (i)  explicit  expressions  for  the  optimal  degree  vector  and 
computational  cost,  given  m,  have  been  derived.  Some  partial  results 
related  the  the  global  optimality  have  been  found  when  is  of  the  form 
8^  =  k  >  2. 

With  respect  to  (ii),  the  number  of  k^^  level  gates  to  be 

selected  is  proportional  to  the  number  of  k-1  level  gates  from  which 

they  are  selected.  The  corresponding  solution  was  found  to  be  of  no 

practical  interest.  The  solution  is 

I  n  =  0,  n  =  +  « 
j  1  m 

\  n_,  ...,  n_,  ,  any  value  different  from  0  or  «>, 

dim-L 

m 

such  that  n  n.  »  N. 
i=l  ^ 
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6. 7  Delay  Expression  for  Hierarchical  Networks 

Given  a  network  with  an  m-level  hierarchical  structure,  the 

problem  is  to  express  the  total  average  delay  in  the  network  in  terms 

of  the  average  delays  in  the  layer  subnets  composing  the  network.  We 

th 

will  assume  that  all  k  layer  subnets  are  of  equal  size,  i.e., 

"k^^m*  ^m-1’  •••*  ^k+1^  ''"k  V-k=l,  ...,m 

and  that  they  will  induce  the  same  average  delay,  Tj^,  over  the  traffic. 
This  last  assun^tion  will  usually  appear  as  a  design  constraint,  and  as 
such,  it  is  a  reasonable  one. 

The  traffic  in  hierarchical  networks  may  be  divided  into  m 
classes.  Class  k  traffic  is  defined  as  the  traffic  between  pairs  of 
nodes  which  belong  to  the  same  k^  level  cluster  but  not  to  any  lower 
level  clusters.  If  we  define  Tj^  as  the  average  delay  incurred  by  class 
k  traffic,  then  obviously 

m  r. 

T  =  E  “T 
k=l  '■ 

where  F,  =  total  class  k  traffic 

k 

r,  T  are  as  defined  in  Eqs.  (4.1)  and  (4.2) 

From  Flow  Assumption  6.1,  we  know  that  class  k  traffic,  when  going  from 

its  origin  to  its  destination,  has  to  go  iq)  through  k-1  layers  to  the 
‘th 

k  common  layer.  Then  it  will  go  down  k-1  layers  in  a  manner  described 
by  Fig.  6.19. 
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LEVEL 


DELAY 


DEGREE 


Figure  6.19.  Illustration  of  the  Tree  Path  of  class-k  Traffic. 


Consequently, 


k-1 


Z 

i=l 


T.  +  T. 
1  K 


k  >  1 


C6.67) 


may  also  be  evaluated  using  the  flow  assumption  but,  in  general,  it 
will  yield  too  complicated  an  expression.  If  we  assume  a  uniform 
traffic  pattern,  i.e.,  Y-i.  =  Y  -Y  (j,k),  then 

JK 


^k  =  '"t^k 


k  >  1 


(6.68) 


Proof: 

Let  Cj^  be  a  k^^  level  cluster,  and  let  ^(i)  and  Cj^  j^(j)  he 

St 

two  arbitrary  k-1  level  clusters  which  belong  to  C,  .  The  traffic 
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exchanged  between  Cj^  j(i)  and  Cj^  j(j)  is,  according  to  the  previous 

definition,  a  class  k  triffic.  The  number  of  nodes  in  a  k-1^^  level 

k-1 

cluster  is  equal  to  (see  Fig.  2.5)  H  n..  Hence,  the  volume  of  traffic 

from  to  ls,  for  any  (i,j),  \  n  n^^  Y  • 

i=l 

The  number  of  pairs  of  k-1^^  level  clusters  in  a  k^^  level 
cluster  is  -  1);  also,  the  number  of  k^^  level  clusters  is 

m 

n  n. .  Consequently,  the  total  amount  of  class  k  traffic  is 
i=k+l 


Y 


(6.69) 


From  the  size  constraint,  Eq.  (2.1),  we  know  that  11  n.  =  N.  Substi- 

i=l  ^ 

tuting  this  last  equation  into  Eq.  (6.69),  we  find  Eq.  (6.68). 

m 

Notice  that  Eq.  (6.68)  checks  with  the  fact  that  F.  =  P. 

k=l  ^ 


We  are  now  ready  to  derive  an  expression  of  T  in  terms  of  the 


T  *5 
‘k 


Proposition  6.4 

Under  the  above  assumptions,  the  total  average  delay  in  a 
hierarchical  network  is 

"  =  iHT  [("  -  "n  "k)"n,  •  II  -  '"k  • ')  n|  n,)T^] 


Proof: 


(6.70) 


Substituting  Eq.  (6.67)  into  Eq.  (6.66),  we  arrive  at 
m  .  m 


=  f  S(r|,  *  2  z 
k=r  i=kHl  ' 


(6.71) 


290 


Notice  that  Eq.  (6.71)  is  a  general  expression  of  T  in  terms  of  Tj^'s 
which  does  not  use  the  uniform  traffic  assumption.  From  Eq.  (6.68), 


m  K-i  ^ 

r  +  2  ^  r  =  Ny(2N  -  (n,  +  1)  n  n.) 

i=k+l  ^  i=l 

Substituting  Eq.  (6.72)  into  Eq.  (6.71),  we  find  Eq.  (6.70). 


(6.72) 


Numerical  example 


N  =  10  ,  n^  =  n^  =  n^  =  10, 


T  =  0.91^  +  I.82T2  +  l.99Tj  . 


B  =  1 


(6.73) 


6. 8  Conclusion 

In  this  chapter  we  studied  the  major  aspects  related  to  the 
hierarchical  design  of  large  computer  networks.  The  focus  was  primarily 
upon  the  determination  of  a  certain  clustering  structure  of  the  set  of 
nodes  to  be  used  in  the  design  phase.  Optimal  clustering  structures 
were  determined  so  as  to  minimize  the  computational  cost  required  in  the 
design  phase.  The  general  solution  (i.e.,  different  design  strategies 
and  gate  assignment  from  one  level  to  another)  was  derived  when  the 
number  of  hierarchical  levels  m  is  fixed.  The  global  optimum  solution 
was  obtained  with  the  more  uniform  case  whereby  the  same  design  strategy 
and  gate  assignment  are  used  at  all  levels.  The  global  optimum  solution 
is  such  that  all  degrees  at  all  levels  (except  the  first  level)  are 
equal  (all  layer  subnets  are  of  equal  size).  Such  a  peculiar  property 


A  similar  expression  as  in  liq.  (6.73)  i.‘  given  in  [NAC  73];  It  is 
T  =  T3  .  2T2  ^  ZTj. 
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was  found  to  have  an  intuitive  explanation.  Furthermore,  we  mention 
the  one-to-one  correspondence  of  the  Klobal  optimum  solution  for 
a  =  K/ (K  -  1)  with  some  regular  trees  of  downward  degree  K. 

Finally,  we  were  able  to  decompose  the  average  message  delay 
in  a  hierarchical  network  in  terms  of  the  average  delays  in  the  layer 
subnets  composing  the  network. 


CHAPTER  7 


CONCLUSIONS  AND  SUGGESTIONS  FOR  FUTURE  RESEARCH 

7. I  Conclusions 

Faced  with  the  prohibitive  cost  of  a  simple  extrapolation  of 
present  design  and  routing  procedures  for  large  networks,  the  goal  of 
this  dissertation  was  to  evaluate  some  new  techniques  to  be  used  in  the 
context  of  large  networks.  The  techniques  studied  here  represent  an 
extension  of  present  schemes  and  rely  mainly  on  the  natural  hierarchical 
clustering  of  the  network  nodes.  More  specifically  we  specified,  eval¬ 
uated  and  discussed  the  adaptive  m-level  Hierarchical  touting  (MIR) 
schemes  as  well  as  the  m-level  Hierarchical  Topology  (MIT)  design  of 
large  networks.  The  results  obtained  are  summarized  in  Section  1.3. 
Basically,  in  this  research  we  were  able  to  show  that: 

1)  Under  some  reasonable  cost  and  performance  constraints  for 
a  class  of  large  distributed  networks,  present  routing  schemes  become 
infeasible,  whereas  hierarchical  routing  schemes  with  optimally  chosen 
table  lengths  maintain  remarkably  good  network  performance  for  a  phe¬ 
nomenal  range  of  network  sizes. 

2)  Some  optimal  structures  were  found  to  minimize  the  com¬ 
putational  cost  involved  in  the  hierarchical  design  of  large  networks. 

3)  Some  extensions  of  existing  queueing  models  for  networks 
were  introduced  in  order  to  accomodate  nodal  storage  requirements  and 
line  overhead  due  to  routing  updates  which  become  critical  in  the  context 
of  large  nets.  Furthermore,  several  buffer  sharing  schemes  were  pro¬ 
posed  and  analyzed  in  order  to  optimally  utilize  the  nodal  storage. 
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7.2 


Future  Research 


While  there  exists  an  abundant  literature  on  issues  related  to 
the  design  of  small  and  iiraderato  sized  networks,  the  stialy  of  large 
networks  is  still  in  its  very  early  stages.  Consequently,  we  do  not 
only  inherit  unsolved  (or  poorly  solved)  problems  related  to  computer 
networks,  but  also  those  adequately  solved  need  to  be  reevaluated  in 
the  context  of  large  nets.  Of  importance  are  issues  related  to  the 
flow  control,  reliability,  security  and  distributed  data  bases.  We 
also  must  not  forget  the  long-standing  search  for  exact  solutions  of 
the  topology  design  problem  and  for  more  precise  analytic  network 
queueing  models. 

As  for  research  areas  which  emerge  directly  from  this  work, 
we  mention  the  important  issue  of  clustering,  i.e.,  assignment  of  nodes 
to  clusters,  clusters  to  superclusters,  etc.  We  briefly  addressed  that 
question  in  Chapter  5,  but  further  work  is  certainly  requi red.  Other 
clustering  techniques  (such  as  those  based  on  a  graph  theoretic  approach) 
must  be  investigated  and  experimented  with.  The  same  issue  arises  in 
the  topology  design  of  large  nets  where  altogether  different  nearness 
measures  and  clustering  techniques  may  be  required.  Moreover,  the 
hierarchical  design  procedures  should  be  compared  to  the  non-hic rarchical 
ones  on  some  theoretic  grounds,  rather  than  through  experimentation, 
using  those  algorithms  which  we  here  proved  to  be  extremely  expensive 
in  the  non-hierarchical  case.  We  conjecture  that  some  further  limiting 
results  of  interest  are  possible. 

Some  new  routing  schemes  and  design  metliods  should  also  be 


investigated. 


Filially,  it  appears  that  the  general  methodology  and  decom¬ 
position  HKidels  developed  here  for  the  study  of  large  nets,  may  be 
directly  applicable  or  extended  to  more  general  large  systems  where 
some  sort  of  decomposition  must  be  introduced  to  alleviate  the  diffi¬ 
culty  in  analysis,  design  and  evaluation.  The  identification  of  such 
systems  and  their  study  represent  a  worthwhile  investigation. 
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APPENDIX  A 


PATH  LENGTH  IN  GRID  AND  TORUS  NETWORKS 

I 

The  purpose  of  this  appendix  is  to  derive  closed  form  expres¬ 
sion  for  the  average  path  length  in  Grid- type  networks.  Also  the 

I 

'  distribution  and  the  corresponding  z-transform  of  path  lengths  in  a 

Torus  network,  are  defined  and  determined. 

I  A. 1  Definitions 

Let  (S,  A)  be  a  network  composed  of  a  set  of  nodes  S  and  a  set 
of  arcs  A.  Let  s,  t  be  any  pair  of  nodes  in  the  network  and  be 

I  any  path  between  the  two  nodes  [HARA  72],  [HU  69].  Also  let  a^  repre¬ 

sent  the  length  associated  with  arc  i,  then  the  length  of  a  path 
is  defined  as, 

'  =  D  a.  (A.l) 

The  length  of  the  shortest  path  from  s  to  t,  is 

I 

I 

all 

j  h^^  as  defined  above  is  a  distance  function  [HARA  72],  [FRAN  71] 

If  the  length  of  all  channels  is  assumed  to  be  equal  to  1,  then 
hg^  represents  the  minimum  number  of  hops  (channels)  between  s  and  t. 

j  The  average  (shortest)  path  length  in  a  network  is  defined  as 

[KLEI  64], 


I 
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"  '  "f"'-  »  s.?.S 


(A.  2) 


where  N  is  the  size  of  the  set  S.  In  some  instances  it  is  of  interest 
to  consider  the  weighted  average  path  length,  h^, 

s  ,teS 

where  is  the  weight  on  the  pair  of  nodes,  s,  t,  and 


rA  E 

s,teS 


's,t 


CA.4) 


Notice  that  if  all  the  are  equal  then  Eq.  (A. 3)  reduces  to  Eq. 

(A. 2) . 

The  diameter,  d,  of  a  network  is  defined  as  the  longest 
shortest  path  in  the  network,  i.e.. 


d  =  max  [KJ  (A.  5) 

s,teS 

In  the  rest  of  this  appendix  we  will  restrict  our  considerations  to  the 
hop  distance. 


A. 2  Average  Path  Length  of  a  Grid 

Let  G  be  a  rectangular  p  q  grid  as  shown  in  Fig.  A.l.  Let 
s  of  coordinates  (x,  y)  and  t  of  coordinates  (u,  v)  be  two  arbitrary 
nodes  in  G.  Then  from  Fig.  A.l  we  see  that 

hst  =  |u  -  x|  +  Iv  -  y|  (A. 6) 

Let  Hg(x,  y)  denote  the  distances  (length  of  shortest  path)  from  s  to 
all  other  nodes  in  G, 
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(A.  7) 


H-Cx.  y)  A  £  h 
teS  ^ 

Substituting  Eq.  (A. 6)  in  the  above  equation,  we  find 


y)  =  L  £(|u  -  x|  +  |v  -  y|) 

U=1  V=1 


(A.  8) 


P  ,  ,  q 

H  (x,  y)  =  q  £  |u  -  x|  +  p  (v  -  y) 
u=l  •  v=l 


Hg(x,  y)  =  q[x^  -  (p  +  l)x  +  +  pj^y^  _  (q  +  i)y  + 


(A.9) 


Eq.  (A. 2)  may  be  rewritten  as. 


N(N  -  1)  “  N(N  -  1)  ^  (A.  10) 

seS  ^  x=l  y=l 


Substituting  Eq.  (A.9)  into  the  above  summation,  and  after  some 
algebra,  we  arrive  at 

^  “  j  q^pCp  +i)(p  -  1)  +  jp^qCq  +  i)(q  -  i) 

x=l  y=l 


Hence 


N(N  -  1)  *  pq(pq  -  1) 


(A.ll) 


(A. 12) 


From  Fig.  A.l,  we  can  immediately  see  that  the  diameter  of 


d  =  p  +  q  -  2 


(A. 13) 
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For  a  square  grid  p  =  q  =  \/n^  hence 

I  h  =  |/ir 

*  d  =  2  /ii  -  2 


(A. 14) 

A. 3  Average  Path  Length  of  a  Torus 

A  torus  is  a  grid  where  the  outside  nodes  are  connected 
together  in  a  ring- like  fashion,  as  shown  in  Fig.  A. 2.  The  purpose  of 
the  external  rings  is  to  introduce  topological  synunetry  with  respect 
to  the  nodes.  In  other  words,  all  nodes  are  topologically  equivalent. 
As  a  consequence,  the  distribution  of  path  lengths  from  a  given  node 
to  any  other  node  is  the  same  for  all  nodes.  Hence  the  average  path 
length  simply  becomes, 

h  =  n4t  ^  ‘'st  =  y'> 

teS 

where  s  is  an  arbitrary  node  of  coordinate  (x,  y) .  In  order  to  sim¬ 
plify  the  calculations  we  choose  s  as  shown  in  Fig.  A. 2.  This  figure 
shows  all  such  nodes  s,  for  all  cases  except  for  p  odd  and  q  even, 
which  is  equivalent  to  p  even  and  q  odd.  The  addition  (to  a  grid) 
of  the  external  links  does  not  affect  the  coii5)utation  of  Hg(x,  y)  for 
such  a  choice  of  s;  hence  the  result  in  Eq.  (A. 9)  is  true  for  those 
nodes.  Also  the  diameter  of  the  torus  is  equal  to  the  distance  from  s 
to  the  upper  right  comer  node.  Several  cases  to  consider  are: 


1.  p,  q  even 


1 .  p  even 


h  =  — ^ _ _  JC.. .  ■ 

2(p2  -  1)  2(N  -1) 

d  =  p  =  y/W 


ii.  p  odd 


n/n 

2 

1  =  /i7-  1 


(A.  2 


(A.  2 


Notice  that  for  a  large  N  Eqs.  (A. 20)  and  (A. 21)  are  equivalent. 
Hence  in  the  study  of  large  tori,  we  will  not  differentiate  between 
the  two  cases  above. 


A.  4  Distribution  of  Path  Lengths  in  a  Square  Torus 

Let  hg  be  a  discrete  random  variable  which  represents  the 
distance  (length  of  shortest  path)  from  an  arbitrary  node  s  to  any 
other  node  in  the  network. 

The  probability  distribution  of  h^  is  defined  as. 


P  [h  =  k] 


number  of  nodes  at  a  distance  k  from  s 


N 


(A.2^J 


The  corresponding  z-transform  is  defined  as, 

H^(z)  =  I  z^lhg  =  k]  (A.23) 

k 

Similarly,  let  h  be  the  random  variable  which  represents  distances  in 
the  network.  The  probability  distribution  of  h  is  defined  as 


p^[h  =  k]  = 


number  of  pairs  of  nodes  separated  by  a  distance  equal  to  k 


N(N  -  1) 


(A. 24) 


Also  the  corresponding  z  transform  is 


H(z)  =  I  =  k] 

k  ^ 


(A.25) 


Evaluation  of  H(z)  for  a  square  torus 


If  we  define  Nj^  as  the  niunber  of  nodes  at  distance  k  from  s,  then 


4k 


1  <  k  <  £ 


-  1 


-  2 


(A. 


4(p  -  k) 


p  1  <  k  <  p  -  1 
2  ~  ~ 


Note  that 


P-1  2 

L  N.  =  p  -  1  =  N  -  1 


(A. 


k=l 


From  the  definition  of  the  distribution  of  path  length,  F,q.  (A. 22) 

N, 


Prth  =  kj 


1  <  k  <  p  -  1 


(o 


(A. 


otherwise 


Substituting  Eq.  (A. 29)  into  Eq.  (A. 25),  we  arrive  at 


p-1  N 


H(z)  =  E 

k=l  ^  ^ 


k  _k 


(A. 


and  from  Eq.  (A. 27), 


H(z)  = 


N  -  1 


fP-l)/2  ,  p-1 

E  kz  +  ^  (p  -  k)z 

k=l  k=(p+l)/2 


To  evaluate  the  above  summations  we  need  the  identity  below 
n 


Li,  _k  z  .  n+1  ,  ,  ^  n^ 

kz  =  - j  (1  +  nz  -  (n  +  l)z  ) 

k=l  (1-z) 


Then  after  some  algebra,  we  find 


4.(l  - 


Remarks 


i.  From  the  definition  of  the  z-transform  (Eq.  (A. 23))  H(z) 
must  be  such  that  H(l)  =  1.  This  fact  can  be  easily  checked  using 
I’Hopital’s  rule. 

ii.  H'(l)  represents  the  average  of  the  random  variable  h. 

It  corresponds  to  the  actual  average  path  length  (in  hops)  in  the  net- 
wark.  Hence,  from  Eq.  (A. 21),  H(2)  must  be  such  that 


H'(l)  =  h  =  p/2 


(A. 32) 


Differentiating  Eq.  (A. 31)  with  respect  to  z,  we  find 

H'(z)  =  - - - Y  1^1  +  z  -  (p  -  1)2^^^  +  (p  +  1)  z^  - 

(N  -  1)(1  -  z)-"  L 


-  1 


,(p+3)/2  _  p  -K  1  Jp-l)/2] 


The  application  of  I'Hopital's  rule  to  H* (z)  at  z  =  1,  shows  that 
effectively  Eq.  (A. 32)  holds  true. 

iii.  For  p  even,  Eq.  (A. 31)  is  no  longer  true.  However,  for 
a  large  N  (as  is  the  case  in  Chapter  4)  we  may  confidently  use  Eq.  (A. 31) 


in  both  cases. 


APPENDIX  B 


ANALYSIS  OF  SHARED  STORAGE 
IN  A  COMPUTER  NETWORK  ENVIRONMENT 


Our  earlier  considerations  (Chapter  4)  lead  us  to  model  the 
store-and- forward  (S/F)  function  of  a  node,  as  R  M|m|1  queueing  systems 
which  share  a  finite  waiting  room,  under  some  scheme  (see  Fig.  4,12). 
The  purpose  of  this  appendix  is  to  analyze  and  compare  a  few  existing 
and/or  intuitive  schemes  as  well  as  others  motivated  by  the  limitations 
(or  deficiencies)  of  the  former. 

The  first  (and  simplest)  scheme  is  the  Complete  Partitioning 
(CP)  scheme  where  actually  no  sharing  is  provided,  but  where  the  entire 
storage  (waiting  room)  is  partitioned  among  the  R  servers  (see  Fig. 
B.l.a).  At  the  other  extreme  is  the  second  scheme,  Conplete  Sharing 
(CS) ,  which  is  such  that  an  arriving  customer  is  accepted  if  space  is 
available,  regardless  of  its  class,  i.e.,  independently  of  the  server 
to  which  it  is  directed  (see  Fig.  B.l.b).  CS  succeeds  in  achieving  a 
better  performance  than  CP  (smaller  probability  of  blocking)  under 
normal  traffic  conditions  and  for  fairly  balanced  input  systems.  How¬ 
ever,  for  quite  asymetrical  input  rates^  (A^  i  =  1,  ...,  R) ,  CS  tends 
to  heavily  favor  servers  with  higher  input  rates,  even  though  they  may 
be  close  to  saturation  (input  rate  equal  or  almost  equal  to  service 
rate).  TTie  failure  to  recognize  servers  at  or  near  saturation  results 
in  most  of  the  space  being  occupied  by  customers  waiting  for  those 

^This  remark  assumes  that  all  servers  have  equal  service  rates. 


servers,  at  the  detriment  of  the  others.  Moreover,  even  with  perfectly 
balanced  arrival  rates  (i.e.,  =  X  i  =  1,  ...,  R) ,  under  heavy 

traffic  conditions  (X  »  yC) ,  CS  fails  (where  CP  succeeds)  in  securing 
a  full  utilization  of  all  the  R  servers. 

The  above  considerations  intuitively  indicate  that  contention 
for  space  must  be  limited  in  some  ways.  In  order  to  avoid  the  possible 
utilization  of  the  entire  space  by  any  particular  type(s)  of  customers, 
we  impose  a  limit  on  the  number  of  buffers  to  be  allocated  at  any  time, 
to  any  server.  This  idea  is  incorporated  in  our  third  scheme:  Sharing 
with  Maximum  Queues  (SMXQ) ,  Fig.  B.l.c.  Of  course,  the  sum  of  those 
maxima  must  be  greater  than  the  total  space  if  some  sharing  is  to  be 
provided.  The  SMXQ,  however,  does  not  guarantee  a  full  utilization  of 
the  servers  under  heavy  traffic  conditions.  This  deficiency  motivates 
the  fourth  scheme:  Sharing  with  a  Minimum  Allocation  (SMA)  scheme. 

With  SMA,  a  minimum  number  of  buffers  is  always  reserved  for  each 
server  and,  in  addition,  a  common  pool  of  buffers  is  to  be  shared  ai./ong 
all  the  servers,  with  no  constraints  on  the  queue  size  (see  Fig.  S.l.d). 

The  shared  area  is  again  prone  to  be,  as  mentioned  earlier, 
unfairly  utilized,  and  hence  the  fifth  and  final  scheme:  Sharing  with 
a  Maximum  Queue  and  a  Minimum  Allocation  (SMQMA) (see  Fig.  B.l.e). 

Rich  and  Schwartz  (RICH  75]  studied  a  scheme  very  similar  to 
SMA  except  that  the  entire  common  storage  is  dynamically  allocated  to 

one  server  at  a  time.  Moreover,  problems  of  this  sort  are  frequently 

encountered  in  telephony  and  are  referred  to  as  graded  systems  [KUHN  73] . 

[SYSK  60].  Their  main  interest,  however,  is  in  sharing  (extra)  lines 

as  opposed  to  storage. 
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In  this  appendix,  we  intend  to  characterize  the  five  schemes 
under  steady  state  conditions;  namely,  we  derive  expressions  for  the 
probabilities  of  blocking,  the  joint  and  marginal  distributions,  the 
average  time  in  system  and  the  throughput  for  different  types  of  cus¬ 
tomers.  The  key  to  the  analysis  lies  in  the  fact  that,  in  steady  state, 
the  joint  probability  distribution  obeys  the  well  known  product  form 
solution  for  networks  of  queues  [JACK  57,  63],  [GORD  67],  [BASK  75], 
[BUZE  71],  [WONG  75],  [KLEI  75,  76]  (and  the  bibliographies  therein). 

A  conparison  of  the  sharing  schemes  is  also  provided. 

B.i  Complete  Partitioning  (CP) 

With  CP,  we  are  in  the  presence  of  R  separate  and  independent 
m|m|1  queueing  systems  with  finite  waiting  rooms  (Fig.  B.I. a).  Queueing 
system  i  (i  =  1 ,  . . . ,  R)  is  characterized  by  a  Poisson  input  stream  of 
rate  X^,  an  exponential  service  time  of  mean  l/pC^  and  a  finite  storage 
of  size  b^.  Customers  destined  to  server  i  are  referred  to  as  type  or 
class  i  customers.  Arriving  customers  who  find  no  space  in  their 
respective  queues  depart  without  service.  Accepted  (non  rejected) 
customers  are  served  on  a  First-Come-First-Serve  (FCFS)  basis. 

The  basic  equations  describing  the  behavior  of  any  of  the 
queues  (say  subsystem  i)  are  well  known  (see  for  example,  [KLEI  75]). 

Let  n^^  be  the  number  of  customers  in  queueing  system  i  (i.e.,  type-i 
customers)  and  Pr[n^  =  k]  be  the  probability,  in  steady  state,  of  having 
k  type-i  customers  (in  queue  and  server) ,  then 


-r-^,  p.  0  <  k  <  b. 
b.  +  l  1  —  —  1 


1  -  p. 


P  [n.  =  k]  = 
1 


(B.l) 


otherwise 


Where  p.  =  X./yC. . 
1  11 


Also,  let  PB^  denote  the  probability  of  blocking  for  type-i  customers 
then  [KLEI  75] 


1  -  p.  b. 

''®i  ■ - BTT  “i* 

i-P,i 


Let  n^  be  the  average  number  of  type-i  customers;  then 


(B.2) 


b.  1+b. 

p,  1  -  (1  +  b,)p/  +  b.p,  ^ 


Let  X^  be  the  average  rate  of  non-rejected  type-i  customers,  i.e.,  the 


throughput  of  server  i;  then 


Xr  =  (1  -  PB.)X. 
1  11 


(B.4) 


Also  if  t^  denotes  the  average  time  in  system  (queue  and  server)  of  non 
blocked  type-i  customer,  then  from  Little's  result 


TTius, 


t.  =  n./\: 
1  11 


b.  1+b. 

1/yC  1  -  (1  +  b,)p/  +  b,p,  ^ 

1  -  P. 


(B.5) 


3 


The  state  of  the  entire  system  can  be  described  by  the  vector 
n  =  (n,,  n»,  ....  n  );  since  all  the  subsystems  are  independent,  the 
joint  probability  distribution  in  steady  state  (to  be  denoted  by  P(n) 
or  P{nj,  n^^))  is 


P(nj, 


R 

■ "  ■  ’  ~  n  p 

^  i=l 


n . 
1 


=  0  otherwise 


¥  n  e  F 

a 

(B.7) 


Where  the  set  F  is  the  set  of  feasible  vectors, 
a 

F  =  {n  integer  valued  vector,  s.t.  0  <  n.  <  b. ,  i  =  1,  ...,  R} 

3l  ^  ““  1  1 


(B.8) 

The  subscript  "a"  is  used  here  to  remind  us  of  scheme  (a),  i.e.,  CP. 

Notice  that  the  constant  term  in  Eq.  (B.7)  is,  of  course,  the 
probability  of  the  entire  system  being  empty,  i.e.,  P(0,  ....  0).  That 
term  will  be  denoted  in  the  sequel  as  P^. 

This  terminates  the  analysis  of  CP  in  its  general  form.  A  few 
more  remarks  are  presented  below. 

i.  b^  and  <  1;  we  obtain  the  usual  M|M|1  result 

=  k]  =  (1  -  p.)pj  (B.9) 


11. 


-►  «>  ;  from  Eqs.  (B.l)  and  (B.2),  we  get 


PB. 

1 

1 

_ 

n. 

1 

-► 

b. 

1 

a: 

1 

MCi 

t. 

1 

-4- 

b. 

1 

(B.IO) 
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iii.  =  1;  all  states  =  0,  1,  . . . ,  are  equally  probable 
and 


1 


PB-  =  1  u 

1  1  +  b. 

1 

b. 

1 

^  =  - 


(B.ll) 


We  now  proceed  with  the  analysis  of  CS.  The  definitions  and 
notation  introduced  in  this  section  are  valid  throughout  this  appendix. 


B.2  Complete  Sharing  (CS) 

We  now  combine  all  the  individual  waiting  rooms  in  a  global 
one  (see  Fig.  B.l.b)  whose  size  will  be  denoted  by  B.  Empty  space  is 
allocated  on  a  FCFS  basis  regardless  of  the  type  of  arriving  customer. 
Drukey  [DRUK  75]  analyzed  this  system  with  the  assumption  that  all  the 
Pi's  (Pi  =  Xi/yCi)  are  equal;  for  the  general  case  of  different  Pi's, 
he  restricts  his  study  to  two  classes  of  customer  (R  =  2).  In  what 
follows  we  intend  to  analyze  the  nrore  general  case  of  arbitrary  Pi's 
i  =  1,  ...,  R,  and  then  we  apply  our  results  to  the  special  case  of 
equal  Pi's.  The  forms  of  the  expressions  in  common  with  Drukey 's  differ 
because  of  the  network  of  queues  approach  used  here. 


B.2.1  General  Case 

In  this  section,  the  Pi's  are  not  necessarily  equal.  The 
sharing  of  space  introduces  dependencies  among  the  R  queueing  systems. 
The  entire  system  is  a  birth-death  process  whose  state  can  be  simply 
described  by  the  vector  n  =  (n^,  ...,  Oj^) .  From  the  state-transition- 
rate  diagram  of  Fig.  B.2,  we  can  write  the  set  of  balance  equations 
below,  describing  the  behavior  of  the  system  in  steady  state  (steady 


^ 


r,;- 


r, 

•b 
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only  if  Hr  >  0  TRANSITION  ALLOWED  only  if  2  nj  <  B 


Figure  B.2.  State-Transition-Rate  Diagram. 


Let  J  =  {j  such  that  =  O} ;  two  sets  of  balance  equations  are 

presented  below  which  depend  on  whether  or  not  the  total  number  of 

customers  is  less  than  or  equal  to  B. 

i.  For  all  n's,  such  that  E.n.  <  B,  we  have 
~  ’  11* 


R 

"r>  ‘  .S  “iPt"!- 


n.+l , 
1  ' 


P  (n  ,  •  •  • » 


ii.  For  all  n's,  such  that  Z.n.  =  B, 

-  11 

R  .  V 

A^P(nj,  •••>  ti^-1,  ^  •••>  •••> 

i^J  i^J 

Let  F,  be  the  state  of  feasible  states,  i.e., 

D 

R 

Fj^  =  {n,  s.t.  0  ^  _<  B  ,  0  £  ^ 

i=l 

Then,  we  claim  that  the  steady  state  joint  queue  length  (queue  +  server) 
distribution  is  such  that 

(  Pq^I  ^2  •••Pr  ^  2  ^  ^b 

P(n)  =<  {B.12) 

I  0  otherwise 

where  P^  is  a  constant;  more  precisely,  it  is  the  probability  of  an 
empty  system  P(0,  0,  0). 

To  prove  this  claim  it  is  sufficient  to  show  that  Eq.  (B.12) 
satisfies  the  above  balance  equations;  this  is  easily  shown  to  be  the 
case.  Notice  also  that  the  product  form  solution  satisfies  the  local 
balance  equations.  In  the  present  situation,  the  local  balance  equation 
[CHAN  72],  [KLEI  75]  is  the  one  which  equates  the  rate  of  flow  out  of 
server  i,  due  to  the  departure  of  customer  (type-i) ,  to  the  rate  of 
flow  in,  due  to  the  arrival  of  a  customer,  to  server  i.  This  approach 
will  be  used  to  analyze  the  other  schemes  whose  global  balance  equations 
are  more  difficult  to  write. 

We  are  now  left  with  the  evaluation  of  P^,  for  that  we  take 
advantage  of  the  fact  that  all  probabilities  must  sum  to  one,  hence 


i 


•  i 


I* 

i 


•  ^ 


t 


9 


9 


h- 
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Let  us  define  G(K)  as 


-1  "i  "7  "r 

neF. 

D 


r-  "1  "r 

G(K)  =  L  P,  "-pR 

a<n^<K  ^  ^ 

E.n.=K 
1  1 


(B.13) 


(B.14) 


Then,  obTdously 


Pn^  =  L  G(K) 


(B.15) 


Several  efficient  altorithms  exist  to  compute  G(K),  [BUZE  71],  [WILL  74] 
[MOOR  72].  Moreover,  if  all  the  P^'s  are  different,  we  can  use  the 
generating  function  approach  to  derive  a  closed  form  expression  for 
G(K),  [MOOR  72].  Briefly,  we  let 

R  R 

g(t)  =  n  n-  ^  ti  =  n  (1  +  Pjt  +  pjt^  +  •..)  (B.16) 

i=l  Pi^^  i=l  ^ 

By  expanding  the  second  product  we  recognize  that  G(K)  is  the  coefficient 


of  t  ,  i.e. 


g(t)  =  1  +  G(l)t  +  G(2)t^  +  ...  +  G(K)t’^  + 


(B.17) 


Iso,  if  the  p^’s  are  different^  (p^  p^),  then  the  partial  fraction 


expression  of  the  first  product  in  Eq.  (B.161  gives 


R  j  R 

I  i?i  -  V’  '  i?i 

( *1 '  n  (1  --pr/pT)' 


(B.18) 


k=l  "  Pk^Pi^ 

m 


^This  condition  will  be  assumed  to  be  true  throughout  the  rest  of  this 
section. 
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Hence 


R 


G(K)  =  £  A  K  =  0,  1,  2, 

1  =  1 


Then,  from  Eqs.  (B.13),  (B.i4)  and  (B.19) 
0-1 


0 


B  R  ^  R  B 

=  s  z  =  i;  A  E 

K=0  i=l  i=l  K=0 


and 


0 


R  1 


(B.19) 


(B.20) 


i=l  '  1 

Eqs.  (B.12)  and  (B.20)  completely  characterize  our  system  in 
the  steady  state.  Notice  that  for  R  =  1,  =  1  and  we  find  Eq.  (B.l) 

for  B  =  bj.  We  now  proceed  to  derive  the  distributions  in  steady  state 
or  expressions  of  other  variables  of  interest. 


Distribution  of  the  total  number  in  system;  Probability  of  Blocking 

Let  n  be  that  number  and  =  Pj.lZ!nj^  =  n]  be  the  corresponding 
distribution;  then  obviously 


P  = 


PoG(n)  =  - 


G(n) 


for  0  <  n  <  B 


L  G(k) 
k=0 


(B.21) 


=  0 


otherwise 


Since  we  have  Poisson  arrivals,  then  the  probability  of  blocking,  PB, 
is  simply  the  probability  that  n  =  B.  Notice  also,  that  PB  is  indepen¬ 
dent  of  the  customer's  type  (PB.  =  PB  -V  i  =  1 ,  .  .  .  ,  R) 


Marginal  Distributions  and  Averages 

Let  us  first  derive  the  probability  that  there  are  more  than  j 
type  i  customers  in  the  system,  ^  That  probability  is  equal 
to  the  sum  of  P(n)  for  n  e  and  such  that  ^  j;  after  some  algebra 
we  find  (for  0  £  j  ^  B) , 


The  average  number  of  type  i  customers  is 

=  £  Prt"i  i  Jl  =  Po  £,  '=<"> 

J=1  j=l  n=0 

Interchanging  the  summations  above,  we  arrive  at 

B-1  B-n  .  B-1  p.  - 

Si  =  P„  £  <=(n)  £  p’  -  P„  £  \  G(„) 

n=0  j=l  n=0  1 


Pi 


hence 


-  _  -i  n=0 


'l  (1  -  Pr")G(n) 


X  1  -  p. 


B 

L  G(n) 
n=0 


(B.24) 


As  for  the  expression  of  the  average  total  number  in  system,  it  is 


simply  n  =  Another  expression  can  be  derived  from  Eq.  (B.21), 

B 

n  =  Pf,  L  kG(k) 

^  k=l 


From  Eq.  (B. 19) 


n  =  P 


B 

0  ^ 
"  k=0 


R 

L 


AiP-  =  Pq  I  \  t  kpj 

i=l  i=l  k=0 


B 

L 


R  1  +  Bp}""®  -  (B  +  l)p® 

Thus  S  =  P  X;  A.  - i - = - -  (B.25) 

^  i=l  ^  (1  -  p.)^ 

where  P^  is  given  by  Eq.  (B.20). 


•  1 


323 


Notice  that  if  R  =  1,  then  n  =  n^,  as  found  in  Eq.  (8.3).  Eqs.  (8.4) 
and  (8.5),  for  the  throughput  and  the  average  delay  t^,  hold  true 
here. 

This  terminates  the  characterization  of  CS  in  its  more  general 
case.  Of  interest  is  the  study  of  its  behavior  under  some  special 
limiting  conditions  of  storage  and  traffic. 


8.2.2  Limiting  Behavior 

We  consider  the  two  cases;  first,  when  8  goes  to  infinity,  and 
second,  when  all  arrival  rates  increase  uniformly  toward  infinity. 


Infinite  waiting  room  8  “ 

For  the  existence  of  a  steady  state,  it  is  necessary  that 
<  1  (i  =  1,  ...,  R) .  With  this  condition,  we  expect  that  at  8  =  <» 
the  system  becomes  equivalent  to  R  independent  M|m|i  queues.  From 
Eq.  (8.20)  and  since  <  1 , 


limit  P 


•1 


0 


R  A. 

T  - — 

1  -  P. 

1=1 


B  -*•  00 

which  is  the  value  of  g(t)  (Eqs.  (8.16)  and  (8.18))  at  t  =  1,  hence. 


P 


0 


R 


n 

i=l 


(1  -  P^) 


R  n. 

Thus  from  Eq.  (8.12)  P(n)  =  fj  (1  -  p.)p.^ 

i=l  ^  ^ 


Moreover,  from  Eq.  (8.23)  we  can  see  that  at  the  limit 


The  last  three  equations  prove  the  fact  that  the  system  is  equivalent  to 
R  independent  m|m|i  queues. 

Note  also,  that  from  Eq.  (B.19)  and  fB.22) 

limit  G(B)  =0  =>  limit  PB  =  0 

B  -»•  oo  B  -►  “> 

Uniform  increase  of  input  rates 

We  now  intend  to  let  all  input  rates  increase  proportionally 
until  eventually  reaching  infinity.  At  this  point  we  show  that  our 
system  becomes  equivalent  to  a  closed  network  of  R  queues  and  B  cus¬ 
tomers.  Let 

Xi  =  nA?  i  =  1,  ....  R  (B.26) 

where  ri  is  a  positive  real  variable  and  A?  is  a  constant.  The  service 
rates  (pC)  are  maintained  constant.  Eq.  (B.26)  is  also  equivalent 
to  saying  that  =  np?  with  p?  constant. 

From  the  above  definition  and  Eq.  (B.15), 


G(K)  =  n'^G^CK)  (B.27) 

R  K 

where  G*^(K)  =  51  A.  (p?)  . 

i=l  ^  ^ 

Notice  that  A^  is  invariant;  hence,  G^(K)is  a  constant.  As  a  result, 
and  from  Eq.  (B.15) 


Pn^  =  L  n‘^G°(K)  =>  limit  P»  =  0 

^  K=0  n  00 

Furthermore,  from  Eq, (B. 21) 

(0  n  B 

limit  P  =  ^ 

n->-oo*^  (l  n  =  B 


In  other  words,  as  expected,  the  system  is  full  with  Probability  1 
(probability  of  blocking  is  consequently  equal  to  1). 

With  regard  to  the  joint  queue  length  distribution,  Eq.  (B,12) 


becomes. 


P(n) 


R 

"  A  (pH 


R  ,  ^  ”i 


¥  n  e  F, 


2-  n  G  (k) 
k=0 


Thus 


n. 

1 


limit  P(n)  = 


G°(B) 


0 


for  n  e  F,  and  s.t.  E.n, 
~  b  11 


otherwise 


=  B 


(B.28) 


With  respect  to  the  marginal  distributions  from  Eq.  (B.24), 

1  j]  =  - 

ri  ->■  00 

Furthermore,  the  limiting  throughput  of  type-i  customers  is 


iwt  p  ,„  >  j, .  (p»f 

-  ^  “  r  1  qU  \  If 


,0, 


limit  (1  -  PBjnA?  =  ^  X? 

n  -»■  <»  ^  G  (B)  ^ 


(B.29) 


(B.30) 


The  above  results  are  typical  of  a  closed  network  of  queues  with  R 
service  centers  and  B  customers,  such  as  Buzen’s  central  server  model 
[BUZE  71].  In  fact,  if  we  consider  the  buffers  as  customers  and  if  we 
let  i.e.,  X?/ZX?  ,  be  the  relative  rate  of  arrivals  to  server  i 

(i.e.,  rate  o;f' allocations  of  freed  buffers  to  server  i) ,  and  n^^  be  the 
number  of  buffers  allocated  to  server  i,  then  the  distribution  of  buf¬ 
fers  is  given  by  the  above  expressions  (Eqs.  (B.28)  and  (B.29)). 
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Moreover,  Eq.  (B.29)  shows  that  there  is  a  non- zero  probability 
that  server  i  is  idle  (  =  o]  =  1  -  G*^(B  -  l)p?/G^(B)).  Therefore, 

even  with  infinite  input  rates,  server  i  is  not  fully  utilized,  which 
was  not  the  case  with  CP  (see  Eq.  (B.IO)). 

The  numerical  example  below  illustrates  the  general  and  limit¬ 
ing  behavior  of  this  system  with  respect  to  n.  In  this  example  we 
assume  that  R  =  4,  B  =  20,  =  0.1,  p®  =  0.4,  p^  =  0.6,  p^  =  0.9 

and  we  let  p.  =  np^  • 

1  1 

The  utilization  of  server  i  is  pf  =  (1  -  PB)p^  =  (1  -  PB)np^  • 
The  limiting  value  of  n(l  -  PB)  =  1.111;  hence,  q  ->•  <»  =>  p'  =  0.1111 
p'  =  0.4444,  p^  =  0.6666,  p'  =  0.9999.  Notice  that  server  4  reached 
saturation  whereas  the  others  are  very  far  from  it.  The  average  total 
limiting  utilization  is  p  =  ^  7  pT  =  0.555  instead  of  1  which  could  be 
obtained  with  CP. 

Furthermore,  Fig.  B.3  shows  the  behavior  of  the  average  number 
of  type-i  customers  in  the  system  (i  =  1,  . . . ,  4)  with  respect  to  q. 
Also  represented,  is  the  average  total  number  in  system,  ii.  The  limit¬ 
ing  values  for  the  averages  are  (obtained  at  q  =  20) 

n^  ■*  0.125,  t\2  0.800,  n^  -*■  1.99,  n^  17.02  and  n  -►  19.94  «  B. 

Notice  that  for  large  q,  most  of  the  buffers  are,  on  the  average,  used 
by  type  4  customers.  Also,  a  sharp  increase  of  ii^  (from  4.8  to  14) 
occurs  when  q  varies  from  0.95  to  1.5.  The  value  of  q  *  1.111  would 
correspond  to  the  saturation  of  server  4  (i.e.,  p^  w  1)  if  there  were 
no  limitation  in  buffer  storage,  and  at  that  point  the  queue  size 
becomes  infinite.  This  explains  the  sharp  increase  in  n^,  mentioned 
above. 
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In  summary,  we  conclude  that  with  quite  asymmetrical  utili¬ 
zations  {p?},  CS  tends  to  favor  the  server  with  the  highest  utilization 
even  though  it  has  reached  saturation.  Furthermore,  the  other  servers 
are  left  with  very  little  space  to  share.  These  considerations  motivate 
the  schemes  studied  in  the  rest  of  this  appendix.  Before  we  proceed, 
let  us  apply  the  general  results  obtained  in  this  section  to  the  case 
where  all  Pj^'s  are  equal. 


B.2.3  Special  Case:  Equal  p^'s 

This  section  deals  with  the  case  where  all  the  p^'s  are  equal 
and  we  let  p  be  the  common  value.  As  a  result,  a  simpler  expression  is 
obtained  for  G(K) ,  and  thus  for  the  other  variables  and  distributions. 

G(K)  is  the  well  known  expression  obtained  for  networks  of 
queues  [KLEI  75] , 


Thus,  from  Eq.  (B.15) 


(B.31) 


(B.32) 


And  from  Eq.  (B.22) 


(B.33) 


Also 


l.n. 

P(n)  =  PqP  ^  ^ 


(B.34) 
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With  respect  to  the  average  nvunber  of  customers  of  any  type  (say  i) ,  the 
numerator  of  Eq.  (B.24)  becomes 


B-1  B-1  B-1  /n  +  R  -  1\ 

£  (1  -  p®-"  CC„)  =  £  G(n)  -  p“  E  ( 
n=0  n=0  n=0  \  R  -  1  / 


=  Z  G(n) 

n=0 


/B  +  R  -  Iv 
\  R  / 


hence. 


n.  =  T-c — 

1  1  -  p 


B-1 

L  G(K)  -  ( 

K=0  ' 


B  +  R  -  1\.B 


B 

L  G(K) 
K=0 


i  =  1,  . . . ,  R  (B. 35) 


In  the  above  proof,  we  used  the  identity 


m  /r  +  k\  /r  +  m+l\ 


m  /r  +  k\  / 

r( 

K=0\  k  /  \ 


(B.36) 


Similarly,  the  average  time  in  system  of  the  non-rejected  type-i 


customers  is 


,  '/‘‘i  kTo 

'i  ■  1  -  p 


i  =  1 ,  . . . ,  R 

(B.37) 


Of  interest  are  the  two  cases  when  p  =  1  and  p  » 


P(nj,  . 


.  • ,  Hr)  =  Pq  V-  n  e 


K  -  ^"i 


B  +  R  1 
i  "  R  +  1  yC. 


We  frequently  used  Eq.  (8.36)  in  the  derivations  of  the  above  equations. 
Note  that  all  states  n  are  of  equal  probability  Pq.  Furthermore,  if 
R  =  1  we  find  the  results  obtained  in  Section  B.l  (Eq.  B.ll). 

The  expression  of  PB  may  be  rewritten  as  1/(1  +  B/R) ,  which  is 
exactly  the  same  as  for  a  single  M|m|i  queue  with  B/R  buffers  (Eq.  B.ll). 
This  means  that  for  p  =  1,  CS  and  CP  lead  to  the  same  probability  of 
blocking.  This  fact  will  be  illustrated  in  the  figures  below. 


The  service  rates  (MC^)  are  assumed  to  be  constant.  The  limits 


are 


Pq-O 


n. 

1 


xr 

1 


B/R 


PB  ^  1 
ii  -»■  B 


(B.39) 


B  +  R  -  1 


B  t-  R  -  1 
R 


MCi 


i  =  1 ,  ...»  R 
i  =  1 ,  . . . ,  R 


As  noticed  earlier,  infinite  input  rates  do  not  lead  to  full  utilization 
of  the  servers  (except  for  R  =  1) ,  but  only  to  a  fraction  B/(B  +  R  -  1) 
of  the  capacity. 

The  behavior  of  the  probability  of  blocking,  PB,  the  throughput 
normalized  with  respect  to  the  capacity,  Xr/yC^,  and  the  delay  normal¬ 
ized  with  respect  to  the  average  service  time,  yC^t^,  are  illustrated 
respectively  in  Figs.  B.4  -  B.6. 

This  concludes  the  analysis  of  the  complete  sharing  (CS)  scheme. 
Let  us  now  compare  it  with  the  complete  partitioning  (CP)  scheme. 
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B.2.4. 


Comparison  of  CT*  and  CS 


We  assume  that  all  p^'s  are  equal  (to  p)  and  that  each  server 
"contributed"  buffers,  i.e.,  =  B^  i  =  1,  R  (see  Fig.  B.l.a), 

therefore,  B  =  RB^. 

With  the  above  conditions  the  behavior  of  CP  (any  queue)  is 
identical  to  CS  with  R  =  1. 

Fig.  B.7  illustrates  the  behavior  of  the  probability  of  block¬ 
ing,  PB,  with  respect  to  p  and  for  a  set  of  values  of  R,  (R  =  1,  . . . ,  4) . 
R  =  1  corresponds  to  CP;  R  =  2,  3,  4  corresponds  to  the  merging  of 
2,  3,  4  single  queues.  Note  that  all  the  curves  meet  at  p  =  1  where, 
from  Eq.  (B.38),  PB  =  1/(1  +  B^) .  Note  also  that  for  0  ^  p  ^  1,  CS  leads 
to  a  smaller  PB,  hence  a  better  performance  than  CP.  This  improvement 
is  quite  considerable  for  small  values  of  B^  and  increases  with  R. 
However,  for  p  ^  1,  CP  shows  a  slightly  better  performance  (smaller  PB) 
than  CS,  namely  for  small  values  of  B^. 

Fig.  B.8  shows  the  respective  channel  utilizations  p(l  -  PB) 
(normalized  throughputs  A'/pC).  Note  the  loss  in  limiting  throughput 
(p  -»■  oo)  with  CS  for  small  values  of  B^. 

Finally,  Fig.  B.9  shows  the  respective  average  delays.  We  note 
that  the  average  message  delay  increases  as  more  buffers  are  provided, 
i.e.,  as  R  increases. 

The  slightly  better  performance  of  CP  for  p  >  1  intuitively  in¬ 
dicates  that  some  buffers  should  be  permanently  allocated  to  each  server. 
This  idea  is  incorporated  in  Scheme  4,  SMA.  Moreover,  we  observed 
earlier  that  very  unbalanced  input  rates  lead  to  uneven  (on  the  average) 
usage  of  the  storage  space.  This  remark  motivates  the  next  scheme,  SMXQ. 
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B.3  Sharing  with  Maximum  Queue  Lengths  (SMXQ) 

Like  CS,  SMXQ  allows  the  sharing  of  a  pool  of  B  buffers  with  a 
further  constraint  imposed  on  the  number  of  buffers  to  be  allocated  to 
any  server,  and  at  any  time.  Let  b^  be  the  maximum  number  of  buffers 
that  can  be  used  by  type-i  customers;  the  set  of  feasible  states  becomes 

i  =  1 »  •  <  •  j  R I 


0  < 


L 

i=l 


n.  < 
1  — 


0  < 


n.  < 
1  — 


b. 

1 


Taking  advantage  of  a  previous  remark,  we  directly  write  the  set  of 
local  balance  equations  which  describe  the  behavior  of  the  system  of 
queues  in  steady  state.  We  consider  below  the  possible  transitions  for 
type-i  customers. 


1. 
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Again,  it  is  easy  to  check  that  the  joint  probability  distribution 
satisfies  Eqs.  (B.12)  and  (B.13)  where  F|^  is  replaced  by  F^. 

The  evaluation  of  is  much  more  complicated  here,  because 
of  the  added  constraint  on  n^.  In  what  follows,  we  again  consider  the 
two  cases  of  different  and  equal  P^'s. 


B . 3 . 1  General  Case 

In  this  section  the  p^'s  are  not  necessarily  equal.  We  first 
evaluate  P^.  From  the  above  considerations 


=  E  Q(K) 

(B.40) 

K=0 

^  *^1  "r 

E  Pi  •••  Pr 

(B.41) 

r 

where  Q(Kj  = 

R 

E  n.=K 
i=l  1 

0<n.<b. 

—  1—1 

Note  that  the  difference  between  Q(K)  and  G(K) ,  Eq.  (B.14),  comes  from 
the  added  constraint,  _<  b.. 

In  order  to  find  Q(K),  we  use  a  method  similar  to  the  generat¬ 
ing  function  approach.  Let  f(t)  be  defined  as 


f(t)  =  ri  (  1  +  P^t  +  pjt^  +  . . .  +  p. H  ^  ) 
i=l 


b.  b. 
1^  1 


(B.42) 

By  expanding  the  above  product,  we  recognize  that  Q(K)  is  the  factor  of 


t  ,  i.e. , 


f(t)  =  Q(0)  +  Q(l)t  +  ...  +  Q(K)t  +  ...  +  Q 


,  R  ,  Ib. 


Note  that  Q(0)  =  1  and  that  the  highest  degree  is  equal  to  ^  b^. 

i  =  l 


r 

r  J 


t-- 


ri 


I- 
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The  function  f(t)  may  be  rewritten  as 


1+b.  1+b. 
R  1  -  p.  ^t  ^ 

f(t)  =  n  ^ 


i=l 


1  -  p.t 


(B.43] 


Let  h(t)  = 


R 

n 


1 


.  ,  1+b.  1+b. 

^=1  1  -  p.  ^t  " 
1 


=  n  ( 1  -  Pi  "t  " 

i=l  ' 


1+b.  1+b. 


k(l+b  )  k(l+b  ) 
-  Pi  t 


(B.44' 


Then  from  Eqs.  (B.16),  (B.42)  -  (B.44),  we  have 

g(t)  =  f(t)h(t) 


(B.45: 


The  above  relation  will  allow  us  to  express  G  in  terms  of  Q  and  C^. 
The  term  C  results  from  the  partial  fraction  expansion  of  h(t) . 
Assuming  that  all  p^^'s  are  different,  we  find 


R 


h(t)  =  E 
i=l 


C. 

1 


1+b.  r*¥7 

1  -  P,  s  ^ 


1+b. 


-  ,  -  k(b.+l) 

L  Ci(l  +  (Pit)  ^  +  ...  +  (p.t)  ^  ...)  (B 


.46 


R 


1 

II  — 

1+b 

-  Cp/pp 

Finally, 

lb. 

If) 

L  r  R  / 

E  G(K)t‘^  = 

Q(K)  t*^ 

1  ^  '"i  ‘  * 

K=0 

\k=o 

) 

^Li=i 

(B.47] 


k(l+b  ) 

+  ...  +  (Pj^t)  +  •••) 


(B.48] 


Then  equating  the  terms  uf  equal  degrees  in  t,  we  arrive  at  a  relation 
between  G,  Q  and  C^'s.  TTiis  relation  is  quite  complicated  and  requires 
the  ordering  of  the  b^'s  and  their  multiples.  However,  if  we  restrict 

g 

our  considerations  either  to  the  case  where  b.  >  -:r  ,  or  when  b.  =  b 
for  all  i,  then  we  obtain  the  simple  relations  below. 

i.  We  assume  that  each  queue  is  allowed  to  occupy  more  than 
half  of  the  entire  space,  i.e., 

b.  2  B/2  V  i  =  1,  ...,  R  (B.49) 

then  2b.  +  2  >  B  and  b.  +  b.  +  2  >  B.  Therefore,  in  order  to  find 

1  1  j 

Q(K) ,  K  =  0,  1,  . . . ,  B,  we  only  need  to  consider  the  terms  in  hft), 

of  degree  0  or  1  +  b^.  Those  terms,  obtained  directly  from  Eq.  (B.44) 

(i.e.,  without  the  assumption  of  different  p^'s),  are 

R  1+b. 

1  +  E  (Pjt)  " 
i=l 

Substituting  the  above  expression  into  Eq.  (B.48),  we  arrive  at 

R  1+b. 

G(K)  =  Q(K)  +  i;  p.  ^Q(K  -  b  -  1)  (B.SO) 

i=l 

i  s.t.  b.  <  K 
1 

ii.  b.=b  i=l,  ...,R 
-1 _ 

Similar  to  G(K) ,  let  us  define 

(l+b)n  (l+b)n„ 

L(K)  =  Y.  P,  •••pR  CB.51) 

Zn.=K  ^  ^ 

1 

Then,  from  Eqs.  (B.45)  -  (B.48), 
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(B.52) 


L(K)  =  L 
i=l 

and  h(t)  =  E  (B.53) 

K>0 

Substituting  the  above  into  Eq.  (B.48),  we  arrive  at 

a 

G(a(b  +  l)  +  k)  =  Q((“  -  i)(b  +  1)  +  k)L(i)  (B.54) 

i=0 

where  0  ^  k  ^  b 
and  0  <  a(b  +  1)  +  k  <  B 


Note  that  G(0)  =  Q(0)  =  L(0)  =  1,  and  that  Eq.  CB.54)  allows  the 
sequential  computation  of  the  sequence  Q(K) ,  for  K  varying  from  1  to  B. 
Note  that  if  b  ^  B/2,  then  Eq.  (B.54)  becomes 


G(k)  =  Q(k) 

G(b  +  1  +  k)  =  Q(b  +  1  +  k)  +  Q(k)L(l) 

^  Xl*h) 


for  0  £  k  £  b 
0  <  k  <  B  -  b  -  1 


and  from  Eq.  (B.51),  L(l)  =  2  P|  ^  ,  thus  the  combination  of  the 

i=l 

last  three  equations  gives  Eq.  (B.50)  with  b^  =  b. 

In  what  follows,  we  restrict  our  considerations  to  the  case 
where  b^^  ^  B/2,  unless  specified  otherwise.  As  a  result,  and  from 
Eqs.  (B.40)  and  (B.50),  we  arrive  at 


Pq^  =  E  G(K)  -  E  Pi  "  E  G(K) 

K=0  i=l  ^  K=0 


B  R  1-b. 


(B.55) 


Note  that  if  b^^  =  B  for  all  i,  then  SMXQ  becomes  CS,  2ind  the  above 

equation  reduces^  to  Eq.  (B.15), 

1  a-1  A 


By  convention  we  set  21  =  0 

a 


¥  a  integer 
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^ 


Similar  to  Section  B.2,  we  now  assume  that  all  the  p^'s  are 
different  (p^  ^  p^  -V  i  ^  j);  then  using  Eqs.  (B.19)  and  (B.20),  we 
arrive  at 


R 

T,  A. 

i= 1  ^ 


R 

E 

i=l 

0<b^<B 


1+b. 

1 

Pi 


R 

E 

j=i 


1  - 


A. 

] 


B-b. 


(B.56) 


Eq.  (B.12)  with  replaced  by  F^,  and  Eq.  (B.55)  or  (B.56)  completely 
characterize  the  queueing  system  under  SMXQ  and  the  condition  of  Eq. 
(B.49).  Similarly  to  Section  B.2,  we  proceed  with  the  derivation  of 
distributions  and  average  quantities  of  interest. 


Distribution  of  the  total  number  in  system:  Probability  of  Blocking 

Let  n  be  the  total  number  in  system;  then,  for  0  £  n  ^  B, 

p„  ^  =  nl  =  P„Q(n). 

r"  R  1+b  •  -» 

^n  “  Pi 

b.<n 

1 

Let  us  now  derive  the  probability  of  blocking  of  type-i  customers,  PB^. 
Recall  that  type-i  customers  are  blocked  if  upon  arrival  the  entire 
space  is  full,  or  if  the  number  of  type-i  customers  is  equal  to  b^. 
Since  arrivals  are  Poisson,  then 


PB.  =  P  [E.n.  =  B  or  n.  =  b.] 

1  r  jj  1  1 


(B.58) 


which  is  also 


PB.  =  P  [S.n.  =  B]  +  P  [n.  =  b.  and  L.n.  <  B] 

1  r'-33  r*^!  i  33  J 
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Moreover, 
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The  summation  above  is  similar  to  that  of  Pq  except  that  B  is  now  B  -  k 

B  -  k 

and  the  con5)onent  n.  is  deleted.  Also  note  that  b.  >  — = —  it  j  i, 

1  j  ^ 

hence  the  above  summa'ion  is  given  by  Eq.  (B.55)  where  B  is  replaced  by 
B  -  k  and  G(K)  by  G. (K)  (as  defined  in  Eq.  (B.59)).  Thus, 

^  r  B-k  R  1+b. 

P  [n  =  k]  =  P  pI  Z  G  (K)  -  Ep.  ^  E  G  (K)  (B.61) 

^  ^  ^  ^  [k=0  ^  3  =  1  J  K=0  ^ 


The  above  summations  can  be  further  reduced  to  expressions  similar  to 
the  one  in  Eq.  (B.56). 

As  for  the  average  number  of  type-i  customers. 


Hi  =  E 

k=l  ^  ^ 


(B.62) 


With  respect  to  the  throughput  Xf  and  the  delay  t^,  the  expressions 
Xr=  (1  -  PB^)X^  and  t^  =  ^i'^^i  true. 

This  terminates  the  characterization  of  the  system  as  operated 
with  SMXQ  and  with  the  assumption  of  b^  ^  B/2  and  different  p^'s.  Next 
we  study  the  case  of  equal  p^'s;  we  leave  numerical  applications  to 
Section  B.6. 


B.3.2  Special  Case;  Equal  p^^’s 

Similar  to  Section  B.2.3,  let  p^^  =  p  -V-  i;  then,  G(K)  is 
given  by  Eq.  (B.31j.  Also,  we  assume  that  all  the  b^'s  are  equal  to  b 
and  that  b  ^  B/2.  Therefore,  from  Eq.  (B.55)  P^^  becomes 


^0^  ■  k c'l-i  y  -  R - 1  'h 


(B.63) 
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As  for  the  distribution  of  the  total  number  of  customers,  from 
Eq.  (B.57)  we  obtain  (for  n  ^  B) 

P 

P 

In  order  to  find  the  probability  of  overflow  PB  (the  same  for  any  type 

of  customers),  we  need  to  evaluate  G  (K)  (Eq.  (B.59));  similarly  to 

1 

G(K) ,  we  have 

G.(K)  =  (‘^  R  !  2 


0  <  n  <  b 


o("  ;  -  ;  y 


(B.64) 


Substituting  Eq.  (B.64)  for  n  =  B,  and  Eq.  (B.65)  into  Eq.  (B.60), 


we  arrive  at 


With  regard  to  the  marginal  distribution  of  any  type  of  customer,  say 
i,  from  Eqs.  (B.61)  and  (B.65),  we  find 


P 

r 


[n^=k] 


for  k  _<  b 
=  0  otherwise 


(R-l)P 


b+1 


B-b-k-1 

L 

K=0 


(B.67) 


The  rest  of  the  expressions,  n^,  X^,  t^,  follow  in  the  same  way  as 
before.  Of  interest  is  the  case  where  p  =  1  and  p  -»■  “ 

P  =  1: 

Using  the  identity  in  Eq.  (B.36),  we  arrive  at 


347 


i'>o‘ -(“;“) 

\  '  (B.68) 

Note  that  if  R  =  1,  then  PB  =  Pq  =  l/(h  +  1),  which  is  exactly  the  prob¬ 
ability  of  blocking  (and  P^)  for  a  single  m|m|i  queue  with  b  buffers; 
because  of  the  constraint,  n^  ^  b,  the  rest  of  the  space  B  -  b  is 
unutilized. 


As  for  R  =  2,  then 
PB  =  vv-  ... 


b  ♦  1 

(B  +  2)(B  +  l)/2  -  (B  -  b  +  1)(B  -  b) 


(B.69) 


Note  that  for  the  two  limiting  cases  where  b  =  B,  i.e.,  CS  and  b  =  B/2, 
i.e..  O’,  we  have  PB  =  2/(B  +  2);  which  checks  with  the  previous  results 
(Sections  B.l  and  B.2).  Moreover,  a  question  arises  as  to  choice  of  b. 
This  question  and  numerical  examples  are  presented  in  Section  B.6. 

Let  us  now  consider  the  case  when  p  goes  to  infinity. 

P  ->  00 

Of  interest  is  the  limiting  utilization.  Let  p'  be  the 
utilization  of  any  server;  then  p''  =  (1  -  PB)p.  Before  we  proceed, 
note  that  Pq  ^  0,  PB  1.  As  for  p' 

/B  +  R-  2\  p/B-b+R-3\  /B-b+R-3\ 

p^  =  (1  -  PB)p  -  *  ^  ^  ^  ^  ^ - Ll! - L:lJ - ' 

(B.R-l).,(B-b.R-2) 

(B.70) 

Note  that  if  R  =  1  then  p'  =  1  (provided  that  B  1) ;  whereas,  for 
R  =  2  (B  ^  1),  two  cases  are  possible: 
i.  B/2  _<  b  <  B  =>  p'  =  1 
ii.  b  =  B  =>  p'  =  B/(B  +  1) 


Both  results  are  to  be  expected,  b  <  B  implies  that  no  one  of  the  two 
types  can  utilize  the  whole  space,  and  hence,  at  p  =  “  the  two  types 
are  always  present,  b  =  B  implies  CS;  hence  we  are  back  to  the  results 
of  Eq.  (B.39)  for  Aj/yC^  and  R  =  2. 

B.4  Sharing  with  Minimum  Allocations  (SMA) 

Similar  to  CS,  SMA  allows  the  sharing  of  a  pool  of  B  buffers, 
and  in  addition,  a^  buffers  are  permanently  allocated  to  type-i  cus¬ 
tomers,  i  =  1,  ...,  R  (see  Fig,  B.l).  As  a  result,  the  set  of  feasible 
states  becomes 

i  R  ) 

{  D  I  E  fo,  n^  -  a^}  _<  B  ,  0  £  n^  <  B  +  a^  i  =  1,  . . . ,  r| 

Here  also,  we  can  write  the  local  balance  equations  and  verify  that  the 
product  form  solution  satisfies  those  equations.  Following  the  same 
steps  as  earlier,  we  first  consider  the  general  case  of  different  p^'s. 

B,4.1  General  Case 

In  order  to  evaluate  P^,  we  partition  the  set  F^  into  disjoint 
subsets  which  lead  to  known  summations.  Let  d?  be  the  set  of  customer 
types, 

,^=  {l,  2,  ...,  r} 
and  let  .it'  be  the  set  of  all  subsets  of  df. 


TTie  set  3t'  contains  2*  elements;  among  them  are  the  set  .>?  itself  and 
the  empty  set.  We  then  associate  with  each  subset,  X^,  a  subset  of  F^, 


S  defined  as 
m 


=  U 


>  a.  i  e  X 

—  1  m 

n. 

<  a.  otherwise 

1 


Equivalently, 


52  (n.  -  a.)  <  B,  a.  <  n.  <  B  +  a, 
.  „  1  1—  1—1—  1 


for  i  e  X 


S  =  <  n  m  ,  ( 

I  n.  <  a.  for  i  i  X  ) 

1  1  1  ^  m  ' 


Obviously, 


Therefore, 


SHS  =  <t)  m?^n  and  F ,  =  (j  S 
m  n  d  m 

m=l 


1  /  R  a..  2  /  i\  11 .  > 

‘  =  E  n  =  E  E  n  Pi' 

neFj  'i=l  '  m=l  neS^  'i=l  ' 


R  n. , 


(B.71) 


Let  H  be  the  summation  over  all  states  in  S_,  then 
m  m’ 


A  /  R  a.  »  /  a. ^ 

L  n  Pi^  =  n  E  PiM 

neS  U=1  ’  iiX  '  Osa.<a,  / 


m  11 


z  n  p^ 

0<  E  (n.-a.)<B  ieX 
.  V  1  1  in 

leX 

m 

a. <n. <B+a. 

11  1 


Hence, 


1  -  p.  ,  a.  n. 

»m=  n  r-r^  n  PfM  z  n  Pi^ 

iiX  ^  ^i  UeX  /  0<  E  n.<B  ieX 


0<n.<B 

1 


Let  us  define  the  generating  function  C^,  such  that 


I 


C  (K)  = 
m 


E  (n  P^\ 

ni=K  ) 


(B.72) 


I 

ieX  ^ 

m 

0<n. sK 
1 


C^(K)  is  similar  to  G(K)  given  in  Eq.  (B.14),  thus  it  can  be  computed 
in  the  same  way.  Finally,  let  a  =  (a^,  ...»  aj^) .  Then 


Po‘t5-  '  '’o‘  -  2  ^ 

m=l 

a. 

.  1  -  p/  a.  B 

H^[a.  B]  ^  H  =  n  _  n  Pi"  E  C  (K) 

itx  ^  ^i  ieX  K=0 

^  m  m 


(B.73) 


Note  that  if  a^  =  0  for  all  i,  then  =  G(K) ,  and  the 

above  equation  reduces  to  the  description  of  CS.  Also,  the  summation  of 

C  (K)  is  set  to  1  if  X  =4). 
m  m 

If  we  now  assume  that  all  Pi's  are  different,  then  from 
Eqs.  (B.18)  and  (B.19), 


C  (K)  =  E  A. 

m  ^  T  *¥i  “  1 


ieX 


m 


(B.74) 


where 


A.  =  n  T- 

kex  ^ 


m 


k^i 


Using  a  similar  summation  as  in  Eq.  (B.20),  we  arrive  at 


(B.75) 


«n.  =  n 
Ux 


1  -  P 


a. 

1 


B+1 

1  -  P. 


Pi  i?x  1  -  P. 


m 


m 


ieX 


m 


(B.76) 
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Eq.  (B.12)  with  replaced  by  F^,  and  Eq.  (B.73)  or  (B.76)  completely 
characterize  our  system.  We  now  proceed  with  the  derivation  of  dis¬ 
tributions  and  averages  of  the  variables  of  interest. 


Distribution  of  total  number  of  customers  in  shared  area;  Probability 
of  Blocking 

Let  n  be  the  total  number  of  customers  in  the  shared  area, 
s 

i.e. , 

A  ^ 

n.  =  S15)  {0,  n.  -  a.}  (B.77] 

^  i=l  ’• 


Then  the  distribution  of  n^  is,  for  k  B,  equal  to  the  sum  of  proba¬ 


bilities  of  states  n  which  satisfy  Eq.  (B.77)  for  n^  =  k.  In  order 

the  evaluate  that  summation,  we  use  the  same  methodology  as  for  the 

determination  of  P-.  Let  F,(k)  C  F,  and  S  (k)  C  S  be  defined  as 
0  d^  d  m^  m 


“  {2  ^  {O.  -  a^}  =  k. 


0  <  n.  <  B  a 
—  1  — 


4 


1  = 


a.  <  n.  <  a.  k  i  e  X 
1—1—1  m 

n.  <  a,  i  i  X 

11  m 


E  ("i  -  a.)  =  k 
i£X 

m 


It  is  obvious  that  F^Ck)  =  ^  S^(k) ,  and  consequently 


P,[n3=M=Po  i  Z  (np^) 

m=l  neS  (k)  '  i=l  ' 


(B.78 


m=l  neS  (k) 
~  m 


Let  h  (k)  be  the  summation  over  all  states  in  S  (k) ,  then  similar  to 
m  m 


k]  =  P 


0 


2 

L 


R 


the  above 


and  that 


a. 

A  ^  ^ 

Note  that  if  X  =  4>.  then  h  (k)  =  H  - - 

">  “  i=l  1  -  p. 


["s '  01  =  ''o  f?-''"/  =  '■o  .n  -T-n 


1+a. 

R  1  -  p.  ^ 


(B.SO") 


Osn. <a. 
1  1 


With  respect  to  the  probability  of  blocking  type-r  customers,  PB  ,  it 


PB  =  P  [n  =  B  and  n  >  a  ] 
r  r  s  r  —  r 


(B.81) 


PB^  can  be  computed  similarly  to  P^[^5  =  B]  with  a  restriction  on  the 

choice  of  the  subsets  X  ,  which  must  contain  r,  i.e., 

m 


PB  =  Pf,  y  h  (B) 
m 

m  I  reX 

•  tn 


(B.82) 


n  1 

There  are  2  such  sets  which  can  be  obtained  as  follows.  Let 

=  jif-  {r}  and  =  {x^  n  =  1,  ....  be  the  set  of  subsets 

of  ,  then  ^ri  “  such  a  subset  of  .:>?  which  contains  r. 

Marginal  distribution  and  average  number  and  time  in  system 

We  now  proceed  with  the  derivation  of  the  marginal  distribution 

of  the  number  of  type-i  customers;  more  precisely,  we  intend  to  find 

P^[n^  ^  j]'  consider  the  two  cases  which  depend  on  whether  j  is 

less  than  a  or  not. 
r 

i.  0  <  j  <  a 

_  •'  Tf* 


n^  n  -j  On 
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Let 


m 


be  the  new  minimum  allocations,  and  F,  be  such  that 

d 

=  {  5  I  23  sup  {o,  n^  -  ar}  ^  B,  0  £  n_<  B  +  a^  i 
'  ''  i=l  1  ^ 


Therefore 


n  n  n_ 
1 _ r _ R 


>  j]  =  V'  fl  — P/— Pr' 


The  summation  above  is  similar  to  the  one  of  ,  Eq.  (B.71),  wher 
is  replaced  by  F^.  Let  us  define  the  vector  a'  =  (a^,  a^,  •••, 
then,  replacing  a  by  a'  in  Eq.  (B.73),  we  find 

■  2" 

Prt"r  i  ■  Vr  ^ 

nFl 


¥•  i  <  a 
•’  ■* 


ii.  a  <  j  <  B 

j.  _  j  _ 


„  n,  n  -1+a  n„ 

"  '’o^r  ^  ^  ^l^**’^r^ 


To  evaluate  the  above  summation,  let  us  define  the  set 


'  5 


23  sup  {O,  n^  -  a^}  5  B  -  j  +  a^j 

0<n.  <B-i+a  +a.  I 
—  1  —  •' 


Then,  repJacing  n  -  j  +  a  by  n  ,  in  the  above  equation,  we  arrive 


P  [n  >  j]  =  P^p 
r*-  r  —  ^  0  ] 


J-a, 


23 

neF'/  1 
~  d 


n 

) 

r 


N 

'R 


n  >a 
r  r 


The  set  F^'  is  similar  to  F,  except  for  the  condition  n  >  a  ,  and  the 


r  —  r 


change  from  B  to  B  -  j  +  a^;  then  similar  to  the  derivation  of  PB^, 


3  -a 

Pyt'^r  -  =  ^O^r  ^  23  \(a,  B  -  j  +  a^  for  <  j  <  B 

m 

m  I  reX  (B.85) 

m 

Eqs.  (B.84)  and  (B.85)  give  the  marginal  distribution  of  the  number  of 

type-r  customers.  As  a  result,  we  can  derive  n  ,  and  t  . 

r  r  r 

Let  us  now  apply  the  above  results  for  the  special  case  of 
uniform  utilization  and  allocation. 


B.4.2  Special  case;  =  p  ,  a^  =  a 

The  assumption  of  equal  p^^'s  and  a^*s,  leads  to  much  simpler 

expressions  of  the  variables  above.  First,  if  p  is  the  size  of  the 

subset  X  (p  =  lx  1),  then  from  Eq.  (B.76) 
m  m  ^ 


Note  that  if  p  =  0  then 


C  (K)  = 
m 


0  K  >  0 


1  K  =  0 


(B.86) 


(B.87) 


Also,  from  Eq.  (B.77) 


(B.88) 
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Note  that  C^(K)  and  H^(k)  depend  only  on  the  size  p  of  the  set  X^.  The 
number  of  sets  X  of  size  p  is  equal  to  (p)  ,  thus  from  Eqs.  (B.73)  and 


(B.88), 


(B.89) 


Similarly,  we  derive  the  expression  for  PB^  (Eq.  (B.82).  Recall  that 
we  only  account  for  sets  X^  which  contain  r,  hence  p  ^  1 . 

=  (B.90, 

r  =  1 ,  .  . . ,  R  . 

Note  that  in  the  above  expressions,  a  was  assumed  to  be  greater  than 
zero;  if  a  =  0,  then  all  subsets  X^  are  empty  except  one:  , 

whose  size  is  equal  to  R.  Moreover,  with  a  =  0,  SMA  reduces  to  CS 
and  Eqs.  (B.89)  and  (B.90)  for  a  =  0  check  with  Eqs.  (B.32)  and  (B.33). 
Now,  if  B  =  0  then  SMA  reduces  to  CP,  and  in  fact,  Eqs.  (B.89)  and 
(B.90)  become 

p-1  -  PB  -  1  -  P  0^ 

^0  -  \  1  -  P  /  ’  ^  "  1  - 

Let  us  now  derive  the  marginal  distribution  of  the  number  of  type-r 

customers.  Eq.  (B.84)  provides  Pj.[nj  ^  j]  for  j  <  a^;  the  terms 

H^[a',  B]  can  be  evaluated  as  in  (B.89)  except  that  a^  =  a^  -  j . 

Therefore,  we  must  distinguish  the  sets  X  which  contain  r  from  those 

m 

which  do  not.  If  p  =  | X^|  ,  then  (p  '  |  )  such  sets  contain  r  and 
"  ^)do  not.  As  a  consequence 


p 

r 


15] 


1 

1 


)( 


y 


for  j  <  a 

r 

(B.91) 


For  the  case  where  j  ^  a^,  then  from  Eq.  (B.85)  and  using  the  same 
procedure  as  above,  we  find 


(B.92) 
jratiot 

Of  further  interest  is  the  limiting  behavior  when  p  goes  to 


With  respect  to  n^,  A^,  t^  ,  they  follow  from  the  above  considerations. 


infinity. 


p  •»  °° 

We  expect,  with  the  minimum  allocation  of  a  buffers  per 
server,  to  obtain  a  full  utilization  of  all  servers.  The  terms  of 


highest  degree  in  PB^  and  p(l  - 

PB  )  are 
r 

/  B  +  R  -  1  \  „B+Ra 

(  R-  1 

for  Pq^  and  PB 

r/B+R-2\  /B>R-2\'] 

[(  R-i  )*(  R-2  )\ 

for  p(l  -  PBp 

Therefore,  p  -»■  <»  “>  P.  0,  PB 
'  O  r 

1,  pfl  -  PBp  1. 

This  concludes  the  analysis  of  CSMA,  the  numerical  applications 
and  questions  related  to  the  choice  of  a  are  treated  in  Section  B.6. 
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B. 5  Sharing  with  Maximtun  Queue  Length  and  Minimum 

Allocations :  SMQMA 

In  addition  to  SMA,  SMQMA  (or  scheme  e)  imposes  a  constraint 
on  the  number  of  buffers  from  the  shared  pool  to  be  allocated  to  any 
server,  and  at  any  time  (see  Fig.  B.l).  Let  b^  be  that  constraint  with 
respect  to  server  i.  As  a  result,  the  set  of  feasible  states  becomes 


^'e  "  ^  ^d  I  0  1  sup  {O,  n^  -  a^}  <  b^  i  =  1,  . . . ,  r} 


Equivalently, 


,  R  V 

^  {2  ^  0  _<  23  sup  |0,  n^^  '  1  B,  0  1  +  b^| 


Here  also,  we  can  check  that  the  product  form  solution  satisfies  the 
local  balance  equations.  Thus  we  may  proceed  as  earlier  with  the 
evaluation  of  Pq  and  subsequently  determine  the  distributions  and 
quantities  of  interest.  First  we  consider  the  general  case  of  differ¬ 
ent  Pj^'s. 


General  case 

The  same  procedure  as  in  Section  B.4  can  be  utilized  to 
partition  the  set  F^  into  disjoint  subsets  which  then  lead  to  known 
summations.  Those  subsets  are 


[ 

H  ("i 

-  a.)  <  B,  a.  <  n.  <  b.  +  a. 

1—  1—1—1  1 

i  e  X 

m 

s®  = 

m 

h 

ieX 

m 

n.  <  a. 

1  1 

i  4  X 
^  m 

Consequently 

* 

.  2" 

^0*  =  s 

m=l 

(B.93) 
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where  Q(K)  is  as  defined  in  Eq.  (B.41) 


As  a  consequence,  the  con5)utation  of  follows  as  in 
Sections  B.3  (for  Q(K))  and  B.4.  This  remark  holds  true  for  the  compu¬ 
tation  of  the  summations  which  may  appear  in  the  analysis  of  this 
scheme.  As  a  result,  we  need  not  carry  the  study  of  this  scheme  any 
further. 


B.6  Further  Numerical  Results  and  Comparisons 

In  this  section  we  intend  to  compare  our  first  four  sharing 
schemes:  CP,  CS,  SMXQ,  SMA,  under  the  assumption  of  equal  p^^'s.  Before 
we  proceed,  let  us  recall  that  if  B  is  the  total  number  of  buffers 
(B  =  RBq)  and  b  is  the  maximum  queue  size  (for  any  queue)  when  using 
an  SMXQ  scheme,  then 

1.  if  b  =  B  SMXQ  is  equivalent  to  CS 

2.  if  b  =  Bq  SMXQ  is  equivalent  to  CP 

3.  if  R  =  2  then  SMXQ  is  equivalent  to  SMA  with  a  minimum 
allocation  per  queue  equal  to  B  -  b. 

As  a  result  of  the  above,  the  study  of  SMXQ  with  R  =  2  and  a  variable 
b  will  allow  us  to  cover  the  four  sharing  schemes  to  be  considered  here. 

In  the  numerical  example  below  we  assume  that  R  =  2,  B  =  6 
and  that  b  satisfies  B/2  b  ^  B  (see  Eq.(B.49),  i.e.,  b  =  3, 4, 5, 6. 

From  our  previous  considerations  we  know  that  b  =  3  leads  to  CP,  b  =  4 

and  b  =  5  lead  to  non -degene rate  SMXQ,  SMA,  and  b  =  6  leads  to  CS. 


Figs.  B.IO,  B.ll  and  B.l2  show  respectively  the  probability  of 
blocking  PB,  the  channel  utilization  p(l  -  PB) ,  the  normalized  average 
message  delay  yCT,  obtained  with  the  four  schemes.  With  respect  to 
blocking  and  utilization,  the  optimal  b  (i.e.,  the  optimal  scheme)  is  a 
function  of  p.  We  note  that  for  small  values  of  p,  b  =  6  (i.e.,  CS)  is 
optimal;  as  p  increases  b  =  5,  then  b  =  4  (i.e.,  (SMXQ,  SMA)  become 
optimal,  and  finally,  for  a  larger  p,  b  =  3  (i.e.,  Q*)  becomes  optimal. 
With  respect  to  the  average  delay,  it  is  an  increasing  function  of  b. 
Ihere  is,  consequently,  a  tradeoff  between  the  probability  of  blocking 
and  the  system  delay.  Therefore,  the  selection  of  a  particular  scheme 
should  account  for  these  two  variables  as  well  as  the  load  on  the 
system. 

B. 7  Conclusion 

In  this  study,  we  considered  various  schemes  for  sharing  a 
pool  of  buffers  among  a  set  of  coimnunication  channels  in  a  communication 
network  environment.  Five  sharing  schemes  are  examined  and  the  results 
of  the  analysis  are  presented  and  displayed  in  a  fashion  which  permits 
one  to  establish  the  tradeoffs  among  blocking  probability,  utilization, 
throughput  cind  delay. 

We  found  that  no  one  scheme  is  always  optimal;  one  should 
select  a  scheme  to  fit  the  particular  operational  environment.  This 
study  shows  that,  in  general,  sharing  with  some  restrictions  on  the 
contention  of  space  is  certainly  more  advantageous  than  no-sharing, 
especially  when  little  storage  is  available. 
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■1 
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p(1  -  PB) 
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APPENDIX  C 

PROPOSITION  6.1  AND  ITS  PROOF;  ' 

It 

APPLICATION  TO  SOME  SPECIAL  CASES 


C. 1  Optimal  Clustering  Structure 

Proposition  6.1 

Given  m,  the  number  of  levels  in  the  hierarchy,  and  assuming 
that  >  1  for  all  i  =  1,  ...,  m,  then  the  solution  of  the  optimization 
problem  6.11,  i.e.,  the  optimal  clustering  structure,  is  such  that: 

Ca)  All  clusters  at  the  same  level,  k  =  1,  ...,  m,  are  composed 
of  an  equal  number  of  lower  level  clusters,  i.e.,  all  nodes  at  the  same 
level  in  the  tree  representation  are  of  equal  degree.  The  optimal  degree 
vector  reduces  to  an  m-dimensional  vector,  n  =  (n^,  1X2,  ...»  n^) ,  whose 
components  are  the  solution  of  the  following  set  of  difference  equations: 


n,  (i  ,i  1 ,  . . . ,  i,  ,)  =  n,  for  all  k  =  1,  . . . ,  m 
k^  m'  m-1  k+r  k  ’  ’ 

and  Cl  ,  ...,  1. 

^  m’  ’  k+1) 


n,  = 


1  n.,n  M  ...  n 
2  3  m 


k-1 

n  (a,  -  1) 


k-1 


"k  = 


n  a.  k 
1  ■ 


i=l 


\  •'k 


m 


i=l 


n  n 

Li=k+1 


k+1 


(6.12) 


k  —  2,  3,  .**1  ^ 


m 


where  by  convention  JJ  n.  =  1 

i=m+l 


•  r 


t 


r 


if' 


» 


9 


9 
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9 


During  the  course  of  the  proof,  we  will  first  fix  a  set  of 
variables  and  carry  the  optimization  step  over  the  subset  of  non- fixed 
variables.  Then  we  will  replace  the  optimal  values  of  the  non-fixed 
variables  in  the  objective  function  and  repeat  the  procedure  until  we 
exhaust  all  of  the  variables.  Note  that  the  optimal  values  of  the 
non-fixed  variables  will  be  expressed  in  terms  of  the  fixed  variables. 
Step-by-step,  the  above  strategy  will  proceed  as  follows: 

1.  k  1 

2.  Fix  all  variables  n.(i  ,  ...,  i.  ,)  ¥j  =  k+1,  ...,  m 
Solve  the  optimization  problem  with  respect  to  the 
variables  n^^Ci^,  ..., 
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3.  Replace  the  nj^(  )'s  by  their  optimal  values  in  the 
objective  function. 

4.  If  k  =  m  Stop 

Else  k  k  +  1  Go  to  Step  2 

Note  that  when  k  =  m,  there  are  no  fixed  variables,  and  the 
optimization .  is  carried  out  with  respect  to  n^. 

To  introduce  some  of  the  ideas  incorporated  in  the  general 
proof,  we  shall  first  solve  for  m  =  2. 


Optimality  for  m  =  2 

The  optimization  problem  becomes: 


ot. 


min:  G(2,  n,  a,  6)  =  [nj(i2)]  ^  +  [^2"2^ 


a-, 


over :  n 


n. 


constraint:  52  "i (i^)  =  N  ;  n  >  0 


Claim  C.l 


According  to  Proposition  6.1,  the  solution  of  the  above  problem 


must  be  such  that: 


Hence 


1 

r  X  1  ^ 

aj[nj(i2]  -  X  =  0  =>  n^Ci^)  =  [-J 

I2  ™  1  j  •  ■  “  >  n^ 

which  means  that  all  n^Ci^) 's  are  equal;  therefore,  from  the 
constraint : 

N 

ni(i2)  =  —  ¥  i2  =  1,  n2  (notice  nj(i2)  > 


Step  2 

n2  is  now  a  variable,  and  the  problem  becomes: 

“1  ^‘“l  “2 

min  G  =  N  (n2)  +  (32^2) 

over:  "2  ^  0 

Since  a^,  (x^  are  strictly  greater  than  one,  the  objective  is 

jp 

function  in  the  region  (n2  >  0).  Then,  if  n2  solution  of  ^ 

* 

such  that  n2  >  0,  it  must  be  the  optimal  solution. 

AC  S  “7”^ 

=  0  =>  (1  -  aj)(N)  ^n2)  +  02(82)  Cn2) 


Hence, 


"2  = 


Oi  -  1  a 

- — N  ^ 


1.02^ 


aj+a2-l 


Notice  n2  >  0. 


By  replacing  n2  in  the  objective  function. 


a_  a,+a.,-l 


G  =  (n2) 


1 —O.  0(|  \Ai  ' 

(N)  ♦  (62)  (02) 


size 


0) 


a  convex 
=  0  is 


0 
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min  G(2,  n,  a,  g)  =  +  a2  -  1) 


Proof: 


Following  the  strategy  outlined  above,  the  proof  will  proceed 


in  two  steps: 


Step  1 


n2  is  fixed.  Solve  the  problem  with  respect  to  nj^(i2)  , 


i2  ■”  i*  2,  n2. 


The  objective  function,  being  a  sum  of  power  functions 
“l 

[nj(i2)]  ,  which  are  convex  in  the  region  {nj(i2)  >  0,  for  all  i's}, 

is  a  convex  function  in  that  region  (recall  >  1).  Taking  the 


Langrangian : 


"2  o  r  "2  1 

LCiJi,  A)  =  2]  [nj(i2)]  -  A  2  ^ 

io=l  Li-,»1  J 


where  n^  =  {nj(i2)}  and  A  ^  0. 


Notice  that  we  discarded  the  constant  term  (821^2^ 
positivity  constraint,  which  we  will  check  a  posteriori. 

The  Langrangian  is  also  a  convex  function  (for  "2(12^  ^  » 

and  at  optimality  n,  must  be  such  that: 


l-a,/a„  +  Qt 


G  = 


(n2) 


1 


a-, 


La2(62) 


1-a, 


rt,  ♦  a,  -  1  a, 

■■  (N) 


which,  after  grouping  the  terms  in  02  at  the  denominator,  results  in 
the  expression  proposed  by  Claim  C. 1. 


General  Case 

In  accordance  with  the  step-by-step  strategy,  we  will  first 
prove  a  lemma  that  provides  the  solution  for  the  optimization  problem 
obtained  after  k  steps. 


Lemma  C.l 


The  optimal  solution  of  the  optimization  paroblem  obtained 
after  fixing  the  variables  •••»  j  =  k  +  1,  ...,  m 


is  such  that 


•••’  Vl^  *  "a  ^  ^^m’  •••'  V’  ^  •••’ 


"il  = 


rA-1 

n  (a,  -  1) 


[m  * 

"2"3  "k  X,"'  X  ,  ” 

^k+2"^ 


il-1  -j 

1  "a+2^*^ 

•  i=l  " 

V  •••’  V2^J 

'  ***  E 

1  i  =1  ^ 

^  11+2 

V-  i  »  2,  3,  ...,  k 

V2(*^ 

-|-1 

.  E  "k+l^^m* 
"k+2=l 

•••’  V2^J 

(C.l) 


and  the  minimum  G  is  expressed  by: 


G(m,  n.  Of,  3)  = 


?  Vl»n,- 

^k+2 


1=1 


D, 


k+1 


n 


m 


E  • 


.E  t^k+l"k+l^^ 


m 


^k+2^^ 


Vl 


^K+2 


m 

+  E  2*  ?* 

j=k+2 


Proof: 

The  proof  will  proceed  by  induction.  Let  us  prove  that  Lemma 
C.l  is  true  for  k  =  1. 


(i)  k  =  1:  All  variables,  n^C...)  j  =  2,  ...,  m 
Thus  the  optimization  problem  (6.11)  reduces  to: 

n  •  •  •  > 

m  2  m  3 

®in  G.  -  •••  5^  ^2^^ 

over:  n,(i  ,  ...»  i_)  >  0 
1  m  c 

s.t. :  size  constraint 


\ 

Since  >  1,  the  above  objective  is  convex  in  the  region  n^ 
Taking  the  Lagrangian,  we  get 


,  are  fixed. 


's  >  0. 


^(]!}l  *  ^  ^  J  ^^m+1  *  •••»  ^2^ 

^m  ^2 

In  the  region,  {n^  >  O}  ,  the  Langrangian  is  a  convex  function,  and  if 
the  solution  of  VL^  =  0,  where  V  denotes  the  gradient  operator  [ZAN  69] 
is  such  that  n^  >  0,  then  it  must  be  optimal. 


By  equating  the  partial  derivatives  to  zero: 


a^En^Ci 


m 


a  -1 

i2l  =  ^ 


V  (i 


m 


^2^ 


This  iinplies  that  all  n^'s  must  be  equal.  Hence,  from  the  size 
constraint. 


"l^^m*  •••.  ^2^  “  "l  *  Y, 


N 


m 


^3 


By  replacing  n^^  in  the  objective  function. 


n^j  (i  >  •  •  •  * 
c  m 


m 

2 

3=2 


Since 


a 


N  ^  and  D<, 


1,  the  above  expressions  satisfy  Lemma  C.l. 


(ii)  General  Case 

Assuming  that  Equations  (C.l)  and  (C.2)  are  true  for 
A  =  1,  2,  ...,  k  ,  with  k  <  m,  let  us  prove  that  they  are  still  true 
for  As  k  +  1. 

According  to  the  step-by-step  strategy,  we  propose  to  solve 
for  step  k  +  1.  Let  all  "j (i^»  •••»  ^j+i^  fixed  ¥  j  =  k  +  2,  ..., 
and  let  us  solve  for 

min:  G(m,  n,  a,  B)  as  given  in  Eq.  (C.2) 
over:  {n,j^j(ij^,  ...,  ij^+25}  '  2k+l  ^ 
m 

Notice  that  the  term,  J]  G,  (m,  n,  a,  S)  in  Eq.  (C.2)  is  a  constant 

j»k+2  ^  ~  ~ 


therefore,  it  will  be  discarded  in  the  optimization  step. 

Claim  C.2 

G(m,  n,  a,  B)  is  convex,  with  respect  to  in  the  region 

>  0.  It  is  sufficient  to  prove  that  Y,  the  first  term  of  Eq.  (C.2) 
is  a  convex  function.  (In  what  follows,  ^k+1^ 

occasionally  will  be  denoted  by  nj^(*).) 

k 


‘k+1^  m 


3Y _ 

(i_»  • • • »  ii 


n  (a.  -  1) 

i=l  ^ 


k 

n(a,-l) 


1)  r  n  (ct. 


L  k+1 

J 

k 

n  (a  -1) 

i=l  ^ 

"k.i'-’j 

•^k.l 

2 

which  we  define  as  3  Y. 

Ihe  same  expression  is  true  for 

Hy  is  equal  to  3  i  (defined  above)  times  a  matrix  of  ones’ s.  Let  us 
prove  that  Hy  is  positive  definite  in  the  region  nj^^^  >  0. 


2k»i»y5k.i  ■  Vi<V 

where  is  the  transpose  of  nj^^j  .  For 

Therefore,  Li  ’  “  Sk.i  ^  “■ 


This  proves  that  Y  is  a  convex  function  in  the  region  of 
interest;  moreover,  since  the  second  term  of  Eq.  (C.2)  is  a  sum  of 
convex  functions,  then  G(m,  n,  a,  0)  is  a  convex  function. 

As  a  consequence  of  Claim  C.2,  if  the  solution,  nj^^j  of  VG  =  0, 
is  such  that  nj^^j  >  0,  then  it  must  be  the  optimal  solution. 


Notice  that  the  left-hand  side  of  this  equation  is  the  summation  of  all 
\+l^^m*  ^k+2^  *  hence,  it  will  be  the  same  for  all  ...,  ’ 
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Consequently,  all  n. 


k»l'^  are  equal.  Let  ....  =  "k.i. 


From  the  above  equation 

o°^+l,_  ..°^k+l  ^  ^  ,q  \+l 

k+l^k+1  ^"k+1^  “  ^  i  ■ 


n  (a.-l) 
i=l  " 


Vi  E  —  E  V2t'’ 

^k+3 


D, 


k+1 


Groining  the  terms  in  n^^^^  to  the  left-hand  side  leads  to  with 

the  following  exponent  (to  be  denoted:  exp  [  ]  )• 


exp  [n.  ,]  =  a.  ,  + 


Vi\ti  *  .n  (“i  - 1) 
1*1  1=1 


‘k.l^  k.l 
Using  Eq.  (6.2) 


k+1 


k+2 


'Vil  ' 


Also,  the  exponent  of  the  multiple  summation  is  equal  to 

k  k 

+  n  Cttt  -  1)  n  at 

^  i=l  ^  i=l  ^ 


k+1 


k+1 


This  is  due  to  the  fact  that: 


Dk+i  -  («i  -  1)  = 


k 

i?i“i 


(C.3) 


The  proof  of  Eq.  (C.3)  results  from  Eq.  (6.13)  as  follows: 


"  ,n  (“i 


- 1)  = 


k-i 

n 

i=l 


+  n  (a.  -  1) 


:[■ 

=  “kVi[\-i  *  .n  c«i  - 


1) 


=  Wi  • 


.  a^iD^  +  ttj  -  1) 


=  n  a. 

i*l  ^ 


Q.E.D 


Finally, 


“k+1 


k+2 


k 

n  a. 
i«»l 
—5: 


=  n  (“i  - 1)  -  e  "wC)]'^ 

\  Vs 


Extracting  nj^^j  would  give  Eq.  (C.  1)  for  £.  =  k  +  1.  Notice  that 

2k+l  ^  \+l^V  *’**  ^k+2^  "k+1  objective 

function,  Eq.  (C.2),  we  arrive  at 


=  ■  "•2  . H.s’l 

1_  ll.  .  *9 


n  (a.-l) 

i=l  ^ 

Di.  1 
k+1 


m  *k+3 
“k+1  “k+1 


E--  E  nv^,Ci„ . Vj) 


"■  Vr"k+1  f-  ■‘k+2'‘"in’ 

^k+3 


m 

+  2^  G.  (m,  n,  a,  B) 

j=k+2  ~  ~  - 


Replacing  n.  , 


above  with  its  expression  given  in  Eq.  (C.l),  leads 


m  vr  ' 


G  =  B 


k+l 


.n(“i  - B 


i=l 


k+l 


\+l  ^k+1 


n  (a  -1) 
i=l  ^ 

\+2 


[P 


"k+2^*^ 


m  k+3 


+  3 


“k+l 

k+l 


n  (“i  -  1) 
i=l 


L“k+1^\+1^ 


i“k+l  D, 


k+l 


k+l 


“k+l  \*1 


k+2 


[?  —  Z  V2f>] 

1  1,  ,  -* 


m  k+3 


+  ^  ^  Gjj(m,  n.  ct,  3 


3) 


The  exponents  X,  Y  are  such  that 

k  k  k 

n  («i  - 1)  ,  n  («i  - 1)  V  n  («i  -  d 

x»  _ 

°k+l  '  °k+l  / 


"k+2 


n  («.  -  1) 


-  *  i  k 

^r“D —  \\*i*  n  («i  - 1)  -  D  2] 
V1V2  L  ^  i=l  ^ 


Also, 


\*2  “  “k+l'^k+l  *  f“i  " 
1=1 


Thus, 


X  = 


k+l 

n  (a.  -  1) 

i=l  ^ 

°k+2 
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As  for  Y,  we  have 


Y  =  1 


/  11  r 


n  (“i  -  i)> 

•  I  * 


k+l 


'“k+1  D, 


k+l 


k+2 


By  performing  the  same  manipulations  as  for  X, 

k 


n  («.  -  1) 

i=l  ^ 


D, 


k+2 


Consequently,  X  =  Y;  this  property  is  the  key  to  the  proof.  It  says 
that  at  each  optimization  step  we  get  one  less  term  in  the  objective 
function. 

To  prove  that  Eq.  (C.2)  is  satisfied  for  ?•  =  k  +  1,  there 
remains  to  be  shown  that  the  coefficient  of 


^k+3^] 


is  equal  to  ^  coefficient,  then 


n  (a  -1) 
i=l  ^ 


U  = 


n  (a.  -  1) 


i=l 


k+l 


L\+l^^k+l^ 


“k+l  \+l 


k+2 


m' 

•x-i 


®k+l  \+l 


n  (a.  -  1)  » 

i=l  ^  ®k+l 


L“k+l^^k+P 


“k+l  “^k+l 


Then,  the  term  in  the  second  bracket  becomes 

k 

n  ~  B  D  B 

B  +  i=l  k-«-l  ^  k-t-2  k+l 

“k+1  \+l  “k+1  ^k+1 

After  substitution. 


n  (a.-l) 
i=l  ^ 


U  =  D. 


n  (a.  -  1) 

Vi 


n  (a.-l) 
i=l  ^ 


Ot  D 


.^/“i  “  a.  D, 
1=1  k+1  k+1 


Therefore,  U  =  ^jf+2  given  in  Eq.  (6.14).  This  terminates  the  proof 
of  Lemma  C.  1. 

Lemma  C. 1  inplicity  assumed  that  k  £  m  -  1.  Let  us  show  that 
Eqs.  (C.l)  and  (C.2)  are  still  true  for  £  =  k  =  m.  Observe  that  for 
£  =  k  =  m  -  1 


n 

m 

X, 

•••  X  , 

Furthermore,  by  convention,  we  set  the  value  of  the  above  sum 
to  be  1  when  £  =  m.  Consequently,  Eqs.  (C.l)  and  (C.2)  become  mean¬ 
ingful  for  £  =  m.  There  remains  to  be  shown  that  these  equations 
effectively  hold  true  at  that  value  of  £  =  m. 

For  £  =  k  =  m  -  1,  Eq.  (C.2)  becomes: 
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r  .1 

•■I 
:  .1 


P 


1 

,4 

r 


G(m,  n,  a,  6)  =  B^[n^] 


m-1 

n  (a.-l) 
i=l  ^ 


m 


a_ 


+  [3  n  ] 
^  m  m 


m 


(C.4) 


The  optimization  will  now  be  performed  over  the  single 

variable,  n^.  as  given  in  Eq.  (C.4),  is  convex  in  the  region 

{n  >  o}  .  Again,  if  the  solution  n  of  VG  =  0  is  such  that  n  >  0, 
^  m  ^  ®  m  m 

then  it  must  be  the  optimal  solution. 


=  0  =>  a  (3  )  [n  ]  =  n  (a-'l)  tt  1 

m  Li=l  J  m 


m 

n  (a  -1) 
i-1  ^  -  1 

D 

m 


Solving  for  n_, 
m 


n  = 
m 


ot  (3  ) 


a  D 
m  m 


D 

m  _ m 

-  »  B  ■ 

1=1  m 


D 


m+1 


This  result  satisfies  Eq.  (C.  1)  for  il  =  k  =  m,  where  the  summation  is 
set  ot  one. 

Also  we  can  easily  check  that  the  objective  function  becomes 


G(m,  n,  a,  3)  =  B  (C.5) 

~  ~  m+ 1 

since  it  is  the  last  step  in  the  optimization.  The  above  objective 
is  actually  the  minimum  computational  cost.  It  also  checks  with  the 
expression  in  Eq.  (C.2)  where  the  summations  are  set  to  one. 


End  of  Proof  of  Proposition  6.1 

From  Lemma  C.l  and  the  extension  of  the  expresssion  in  Eq. 
(C.  1)  to  £.  =  m,  we  know  that 
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Thus,  the  multiple  summation  in  Eq.  (C.l)  becomes, 

"m  "ll+2  m 

■  S  ***  .51  "Ul^^m’  ^  Vm-l  "n+l  "  •  ?  , 

The  substitution  of  the  above  summations  in  Eq.  (C.l)  results  in 
Eq.  (6.12).  Also,  Eq.  (C.S),  giving  minimum  G,  is  exactly  what  we 
have  to  prove,  i.e.,  Eq.  (6. IS). 


lication  to  Other  Snecial  Cases 


This  section  deals  with  the  following  two  cases: 


(i)  Uniform  design  strategy,  variable  gate  assignment,  i.e.. 


Oj^sa  ^fk=l,  ...,m 


6j^*s  variables 


(C.6) 


(ii)  Proportional  assignment  of  gates. 

In  (i),  explicit  expressions  for  the  optimal  degree  vector  and 
coaf>utational  cost,  given  m,  have  been  derived.  Some  partial  results 
related  to  the  global  optimality  have  been  found  when  is  of  the  form, 
Sfc  =  Y'‘■^  k  i  2- 

With  respect  to  (ii),  the  nujiter  of  level  gates  to  be 

St 

selected  is  proportional  to  the  nun^er  of  k-1  level  gates  from  which 
they  are  selected.  The  corresponding  solution  was  found  to  be  of  no 
practical  interest.  The  solution  is 


+  00 


n_,  ....  n  ,  any  value  different  from  0  or  o®,  s.t.  TT  =  N- 
2  m-1  •'  .  ,  1 


C.2.1  Uniform  Design  Strategy,  Variable  Gate  Assignment 

TTie  purpose  of  this  section  is  to  model  design  requirements 
whereby  different  reliability  constraints  are  imposed  on  the  design  of 
the  layer  subnets  depending  on  their  levels  in  the  hierarchy.  However, 
the  same  design  procedure  is  applied  at  all  levels. 

C.2.1.1  Explicit  Expressions 
Corollary  C.l 

Under  the  conditions  of  Proposition  6.1,  and  assuming  that  all 
the  are  equal,  the  optimal  solution  is  such  that 


(g-U-y-* 


g  r/g  -  1\”  J  D  , 

n,  =  - r  I — r — I  N  m+1 

k  a-l|_\a  /  J 


m  ...m+k-i,.^  ,.i-k-l  k-1  1^-i T,in-k+i-ln 

n  cB.)“  n  (B.f 

i=k+l  ^  i=l  ^ 


,aDm 


m+1 


k  “  1,  2,  ...,  m 

(C.7) 

With  this  optimal  assignment,  the  minimum  computational  cost  is 


G  =  D 


m+1 


”  -"**-‘(g-l)‘-‘ 


(m  I 

n/? 

1*2 


m  m- 1  _  ,  D 


m+1 


(C.8) 
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Proof: 

From  Corollary  6.1  (Eqs.  (6.21)  and  (6.22)  we  know  that  Eqs. 
(C.7)  and  (C.8)  are  true  with  regard  to  the  terms  containing  a  and  N. 
The  term  containing  6  in  Eq.  (C.7)  can  be  directly  obtained  from  Eq. 
(6.19).  The  rest  of  the  proof  of  Eq.  (C.8)  proceeds  by  induction 
exactly  in  the  same  way  as  in  the  proof  of  Eq.  (6.21). 


Limiting  behavior  when  a  ->•  <» 

From  Eqs.  (6.23)  and  (C.7) 

limit  n.  =  ^ 
a  +00  Pr 


m 


N  n  e. 


1/m 


(C.9) 


k~  1}  2y  ...j  m 

Since  gj^  =  Eq.  (C.9)  implies  that  at  the  limit  all  layer  subnets 

must  be  of  equal  size. 


C.2.1.2  A  geometrically  monotonic  gate  assignment  strategy 

In  order  to  reach  any  conclusion  with  regard  to  global 
optimality,  it  is  necessary  to  specify  in  terms  of  k.  Let  3j^  be  a 

geometrically  monotonic  function  of  k,  i.e., 

3^  *  Y*'"^3  k  =  2,  3,  ....  m  (C.IO) 


Y  =  1  corresponds  to  the  uniform  gate  assignment  strategy  studied  in 
Section  6.5.  With  the  above  gate  assignment  strategy,  Eqs.  (C.7)  and 
(C.8)  become: 


Eq.  (C. 14)  is  quite  similar  to  Eq.  (6.30),  but  as  we  will  see  later. 


mj  is  not  the  optimal  number  of  levels  (except  when  Y  =  1) . 

(ii)  In  order  to  satisfy  the  gate  constraint,  Eq.  (6.6),  the 
degree  vector,  must  be  such  that 


n^s^B  >  k»2,...,m 


(C.15) 
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The  above  condition  will  always  be  satisfied  if  m  ^  and  y  >  1. 
Other  situations  may  be  checked  numerically. 


Global  Optimalit) 


/  a  -  1 

Under  the  conditions  Ny/6  >  1  and  y  >  \  "  q~'  )  >  the  optimal 

number  of  levels,  m*,  is  the  unique  positive  root  of  the  equation: 

«['" >" -  (i"  *  “['  - ( V-)”]'" ^ 

(C.16) 

The  corresponding  optimal  degree  is 


/  g-l  In  y 

g  '  In  to/ (a- 1)) 


-  I  In  y 

a  ,  a  ”  \  a  /  lnCa/(a- 


k  ™  2,  m^ 


Notice  that  the  expression  of  n^  is  independent  of  N. 


Proof- 


(C.17) 


Differentiating  Eq.  (C. 12)  with  respect  to  m,  we  arrive  at 


S  ^  .  1„  !^)i„  -2^  .  1„  Tfl 

^m+l  ^  -I 

The  roots  and  the  sign  of  the  above  equation  can  be  determined  from  the 

study  of  Eq.  (C.16)'  Fig.  C.l  shows  a  straight  line  for  the  left-hand 

side  expression  of  Eq.  (C.16)  and  a  monotonically  increasing  (y  >  1) 

or  decreasing  (y  <  1)  curve  for  the  right-hand  side  of  Eq.  (C.16) 


1 

9 


It  also  shows  a  unique  intersection  point  m*.  From  the  relative 
positions  of  those  curves,  we  conclude  that  m*  leads  to  a  minimum 
value  of  G. 


• 

t-  i  I 


X  =  LEFT  HAND  SIDE  OF  EQ.  (C.16) 
Y  =  RIGHT  HAND  SIDE  OF  EQ.  <C.16) 

Yq  =  (Lg  Ny/p)  (Log  (v/(a  -  1) ) 

Yoo  =  Yq  +  £v  Log  7 


Figure  C.1.  Comparative  Behavior  of  the  Two  Sides  of  Eq.  (C.16). 


C.2.2  Proportional  Gate  Assignment 

In  certain  situations,  it  is  imijortant  to  make  the  number  of 

gates  selected  at  any  level  sensitive  (e.g.,  proportional)  to  the 

number  of  nodes  of  the  set  from  which  they  are  selected.  Let  y  (y  < 

Ic  Jc 

be  the  fraction  of  the  level  gates  to  be  selected  as  k+1^^  level 
gates,  and  let  y  -  (Yj»  Y2»  •••»  Yj^^)  •  From  the  above  definitions,  the 


••ii 

■-■.3 

If* 


degree  and  the  size  vectors  are  such  that 


•••  ^2^ 


thus , 


*•*’  k+P 


\^^m’  •••’  ^k+1^ 

X,  Yk-l^k-l^^m’  •••’  ^k^ 

k  =  2.  ., 


"k=l 


k+r 


•’  ^k+1^ 


The  general  expression  of  the  computational  cost  of  all  k^^ 
subnets  is 

Vlf%*  ^k+2^  a, 

i  =1  i,  ,=1  ™  ^  ^ 

m  k+I 


m 

G^(m,  n,  ot,  Y)  = 


Also,  Eq.  (6.9)  still  holds  true. 

Substituting  Eq.  (C.19)  into  Eq.  (C.20)  for  k  =  m, 


m 


r  n 

m  /  m-1  V 

X,  13  n  Yj  n  (i . i 

i=l  i.=l'j=l  ^ 

L  m  2  •' 


.a 


^2^ 


m 


(C.18) 

.  ,  m 

m’  •••’  ^2^ 

(C.19) 

level  layer 

^  (C.20) 

we  find 


Consequently,  if  n  satisfies  the  size  constraint,  Eq.  (2.1) 


Hence  G(m,  n,  a,  y)  i 

Therefore,  if  for  a  particular  feasible  solution  n,  G  is  equal  to  its 
lower  bound  given  above,  then  that  vector  n  is  optimal. 


rm-i 


m 


¥  n  feasible 


(C.21) 


Proposition  C. 2 

Under  the  proportional  gate  assignment  and  for  >  1, 

(k  =  1,  . . . ,  m)  and  a  fixed  m,  an  optimal  solution  of  Problem  6.11 


is  n*. 


nj  =  0 


n*  =  + 
m 


n*.  n-. 


m 


’  "m-1  ^  ^  n  n?  =  N 


i=l 


(C.22) 


With  this  assignment 


m-1  vtt  a 


(Jll-X  vU  I 

Yjj N 


m 


Proof: 


Let  n  be  a  feasible  vector,  and  such  that 


\^^m’  ^k+1^  ""k  ¥k  =  l,...,m-l 


n  n  =  N 
k=l  k 


(C.23) 


From  Eqs.  (C.19),  (C.20)  and  (C.23),  we  jet 
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G(in,  n,  a,  y) 


a  -] 

=  N(nj) 


m-1  ,  k-1  .a,  /  m  \l-a.  /m-1  a  a 

E  N  n  Yj  ^  n  n,  ^  nY4"N" 

k=2  V  i=l  U=k+1  'i=l 


For  n  equals  n*,  (Eq.  (C.22)) 

/  ”  \^"“k 

limit  [IT  =0  k  =  2,...,m-l 

n  U=k+1 
m 


(m-1  .a  a 

n  Y^j  N 

Then,  because  of  Eq.  (C.21),  n*  is  optimal. 

For  a  realistic  network,  this  optimal  solution  is  meaningless, 
and  it  is  necessary  either  to  keep  the  integer  constraint  on  n  or  to 
introduce  other  constraints. 
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